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Abstract 

We propose an elasticity theory for one and two dimensional arrays of globular proteins for which 
the free energy is affected by relative position and relative rotation between neighboring molecules. 
The kinematics of such assemblies is described, the conditions of compatibility are found, a form 
of the free energy is given, and formulas for applied forces and moments are developed. It is 
shown that fully relaxed states of sheets consist of helically deformed sheets which themselves are 
composed of helical chains of molecules in rational directions. 

We apply the theory to the fascinating contractile deformation that occurs in the tail sheath of 
the virus bacteriophage T4, which aids its invasion of its bacterial host. Using electron density 
maps of extended and contracted sheath, we approximate the domains of each molecule by ellipsoids 
and then evaluate our formulas for the position and orientation of each molecule. We show that, 
with the resulting kinematic description, the configurations of extended and contracted tail sheath 
are generated by a simple formula. We proposed a constrained version of the theory based on 
measurements on extended and contracted sheath. Following a suggestion of Pauling [Discussions 
of the Faraday Society 13, 170-6 (1953)], we develop a simple model of the molecular interaction. 
The resulting free energy is found to have a double-well structure. Certain simple deformations 
are studied (tension, torsion inflation); the theory predicts a first-order Poynting effect and some 
unexpected relations among moduli. Finally, the force of penetration is given, and a possibly 
interesting program of epitaxial growth and patterning of such sheets is suggested. 
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I. INTRODUCTION 

A remarkably large number of biological structures are composed of identical protein 
molecules, or mixtures of a few different protein molecules, in regular arrays. Examples are 
microtubules, bacterial flagella, F-actin filaments and viral coats. These are different from 
typical inorganic crystals in that the individual molecules are composed of many atoms 
and the whole array is typically not a 3-D crystal but is often a single molecule thick 
regular array on a sheet, either flat, curved or polyhedral, or else in a linear chain. Often 
the latter adopt helical forms, and, in the example below of bacteriophage T4 tail sheath, 
the cylindrical sheath is composed of two families of helical chains of proteins. Besides 
the reduced dimensionality and natural curvature, the protein-protein interactions involve 
one or more bonding sites with groups of bonded atoms distributed over the bonding site. 
Because of this, the interactions can be complex (from a first principles' viewpoint) and 
proteins exert both forces and moments on each other. However, a simplifying feature of 
the interactions is that individual protein molecules in such arrays predominantly interact 
only with nearest neighbors. 

With the rapid development of optical tweezers and atomic force microscopes 
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has become possible to subject a protein structure to a force or moment and to measure 
its elastic response. These experiments seem to be often interpreted in terms of classical 
macroscopic theories of elasticity. For example, Kirchhoff's rod theory is often used to 
interpret experiments on chains such as DNA p, jj], 0. As discussed by these authors, 
Kirchhoff's rod theory is expected to be valid when the length of the chain is much larger 
than its radius of curvature, and it has been used successfully in such cases. But Kirchhoff's 
rod theory is built on certain assumptions relating to the macroscopic theory of nonlinear 
elasticity. Thus, for example, in its simplest anisotropic form, three suitable bending-torsion 
experiments suffice to determine the moduli, meaning that the mechanical behavior in all 
subsequent experiments is then determined. In addition, with only a few molecules (138 in 
the case of T4 tail sheath) or with localized large curvatures, a molecular elasticity theory 
may be needed. 



For these reasons we develop here an elasticity theory that is suitable for these protein 
arrays. The proteins themselves have irregular yet well-defined shapes and they interact 
via localized bonding sites. We steer a course midway between detailed first principles 
calculations on the given protein sequence (which, at present, would leave us stuck on the 
protein folding problem) and macroscopic nonlinear elasticity. First, each molecule is given 
a position and orientation. We explain how position and orientation are related to detailed 
structure, in a manner that is consistent with the results of first principles calculations. Our 
definition is different from the usual one, but seems to have some advantages. It also has the 
property that the position and orientation are given by a translation and rigid rotation of a 
reference molecule that best approximates, in a least squares sense, the deformed molecule. 
As for interactions, we focus on pairwise forces and moments, because bonding between 
globular proteins is often localized at the region of contact between two proteins. Because 
we have an orientation variable as well as a positional one, our theory also has points of 
contact with the theory of liquid crystals, but in the end the theory is quite different. 

We propose a form of the free energy based on these ideas (Section IHIj) . Imposing the 
condition of frame-indifference, we define certain "strain variables" upon which the free en- 
ergy depends. Conditions of compatibility concern the extent to which these strain variables 
can be assigned independently: for sheets, we find that necessary and sufficient conditions 
for compatibility (of a simply connected sheet, defined precisely below) are that a certain 
pair of sums of four terms vanishes. These relate to the process of checking compatibility 
around elementary squares consisting of four molecules. We find these conditions to be very 
useful. 

For both chains and sheets, helical configurations arise as the natural ordered structures. 
In particular, under general conditions they are the free energy minimizers in the absence of 
boundary conditions. These results can be considered as analytical expressions of the ideas 
of Crane |g] . Ultimately the reason is the same reason that these configurations are natural 
in nonlinear elasticity (Ericksen j)]). To oversimplify, in a helical configuration any two parts 
of the chain, of equal monomer length, are related by a Galilean transformation. Therefore, 
the equilibrium of a short section implies, via rotation and translation, that the whole helix 
is in equilibrium. In the present context they play a deeper role: fully relaxed states of a 
sheet governed by our free energy consist of helically deformed sheets which themselves are 




FIG. 1: (a) Structure of bacteriophage T4, based on electron microscope structure analysis to a 
resolution of about 2 to 3 nm, from [6|. Reproduced with permission of Fred Eiserling. (b) A cross 
section showing concentric tail tube and sheath annuli, (c) Schematic of contraction process [^J. 

composed of helical chains of molecules (Section |IXJ). 

Using this theory, we study of deformations of the tail sheath of bacteriophage T4, Figure 
^ Bacteriophages are viruses that attack bacteria. The T4 virus is composed of a capsid 
containing the viral DNA (Figure EJi) and a tail shown extending down from the capsid. 
The tail consists of a pair of concentric cylinders (Figure QJd) each about 1000 A long. The 
inner tail tube with a diameter of about 90 A is surrounded by the tail sheath with an outer 
diameter of about 240 A. The sheath is composed of six parallel helices, made from chains 
of a single type of protein. Although the tail sheath is single molecular layer protein sheet, 
it should not be considered (as is assumed by nonlinear plate theory) thin relative to its 
radius of curvature, this ratio being (thickness/mean radius of curvature) ~ 1. 

Prior to invasion of the host, the sheath proteins are arranged as steeply pitched helices, 
and the tail adopts so-called extended structure. During the virus' attack on a bacterium, 
the tail sheath changes shape dramatically; the protein helices compress, causing the sheath 
to shorten and fatten into a more compact contracted structure. This drives the relatively 
rigid inner tail tube through the cell wall, making a passage for the viral DNA to pass into 
the host (Figure^). During this process the sheath contracts irreversibly to about 1/3 of 
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its original length accompanied by a 50% increase in outer diameter. The transformation 
has many features in common with martensitic phase transformations, as has been noted by 
Olson and Hartman pj. For a general review of T4 tail structure and function see Coombs 
and Arisaka. [l(3] 

Once the viral DNA is inside the host, it reprograms the host cell to produce all ingredients 
needed to form new viruses. The viral protein molecules produced by this process self- 
assemble into virus progeny within the host, eventually causing it to burst, releasing the 
viruses to infect other hosts. 

We apply the theory of the protein sheet to the tail sheath of bacterio phag e T4. Using 
measured electron density maps of extended and contracted sheath nj], 12j |. we identify 
three domains in each molecule and approximate these by ellipsoids. We then define the 
structures of the two phases. We develop a simple formula that produces these structures 
(|57)1 . and applies to any any model of the molecule, however complex. The simplicity of 
this formula arises from the fact that, even though the molecule itself may be complex, 
the relationship between different molecules is very simple. Based on the experimentally 
observed mode of deformation, we adopt a constrained theory for the sheath. The con- 
straints are exactly satisfied by both extended and contracted sheath. The resulting theory 
has surprising implications with regard to the response of the sheath to different loadings, 
including a strong first-order Poynting effect, unexpected relations between moduli, certain 
combinations of applied axial force and moment that do no work on the sheath, and a cer- 
tain relation between the force and moment needed to transform contracted to extended 
sheath. Among these results, there are numerous possible points of comparison with future 
small-scale quantitative experiments. 

A very early model of helical contraction proposed by Pauling provides a basis for 
simplifying our free energy. Pauling envisioned that helices forming a cylinder could be 
compressed to the point where adjacent turns of the helix would form bonds. This leads 
to a simple model of interactions with one family of bonds guiding assembly and another 
causing contraction. As higher resolution images of T4 tail sheath become available 
this remains likely the principal mechanism for sheath contraction. Guided by Pauling's 
mechanism, we build a simplified free energy for T4 tail sheath. 

The deformation of T4 tail sheath is a particularly interesting case for the theory for 



several reasons: 1) very large changes of shape in an organized protein structure take place 
as a means of producing force, 2) the shape change has been identified as of martensitic 
type, which suggests a multi-well elasticity theory, and 3) there is an interest in understand- 
ing how this phase transformation relates to nonbiological martensitic transformations, and 
in particular how the force and energy of contraction compare in the biological and non- 
biological cases. The latter could suggest strategies for man-made analogs of the T4 tail 
sheath. Finally, 4) a quantitative evaluation of the energy stored has interesting biological 
significance. That is, in bacteriophage T4, as in all viruses, there is no mechanism for the 
production of energy. Thus, all the energy that is released upon contraction of the tail sheath 
must be stored during the assembly phase of the virus, from apparently high free energy 
molecules created during translation of the viral genome, aided by the energy consuming 
translation mechanism of the bacterial host. For the purpose of storage of this energy, the 
process of epitaxial stabilization, familiar from the growth of semiconductor compounds on 
single crystal substrates, apparently plays an important role. Motivated by these ideas we 
suggest such a program of epitaxy (Section 10). 

Mathematical Notation: bold faced uppercase letters are 3x3 matrices and bold faced 
lowercase letters are vectors in M 3 . Components are relative to a fixed ortho normal basis 
throughout. The summation convention is used, A B is the inner product between matrices: 
A • B = AyBij, and || A ||= V A T A. The superscript T denotes transpose, trA = An 
is the trace of A, I is the identity matrix, (a ® b) is the matrix with components afij, 
SkewA = (1/2)(A — A T ), and 3x3 rotation matrices are denoted by SO(3) = {R : R T R = 

I, det R = +1}. denotes all pairs of integers Sijk is the permutation symbol, defined 
by Sijk = 1 if ijk is an even permutation of 123, e^k = —1 if ijk is an odd permutation of 
123, and = if any index is repeated. 

II. KINEMATIC DESCRIPTION OF INDIVIDUAL PROTEIN MOLECULES, 
CHAINS AND SHEETS 

We are interested in chains and sheets consisting of protein molecules. For simplicity we 
shall consider both structures to consist of identical molecules jsoj. 

A molecule will be specified by a pair (y, R) consisting of a position vector yel 3 and a 
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rotation matrix R G SO (3), termed, respectively, position and orientation. Protein chains 
and sheets are constructed by building up one and two dimensional arrays of these molecules. 
For chains we choose a set of integers {1, . . . , N} corresponding to N molecules and assign 
mappings 

y : {1, . . . , N} - M 3 , R : {1, . . . , N} - SO(3), (1) 

For sheets we denote molecules by pairs of integers G Z 2 . We consider a set T> = 

{1, . . . , iV} x {1, . . . , M} and mappings 

y.V^M 3 , R:£>^SO(3), (2) 

For the configuration of a single molecule we use the obvious notation (y iy Rj) for chains and 
(yi,j, Rjj) for sheets. More generally, T> could have the form S]xZ 2 where Q is a domain in 
the plane. 

The usual way to define orientation of a molecule, or a collection of mo 
take moments of the (time averaged) mass distribution (see, e.g., De Gennes, 
2), typically the second or fourth moment. Below, we suggest a different definition of 
orientation that seems to have advantages with regard to the connection with molecular 
dynamics simulation. 

Each molecule consists v atoms of C, H and various other elements in a folded config- 
uration. For each atom we assign a corresponding atomic mass m ; , i = 1, . . . , v. Near 
physiological temperatures, the atoms in a protein molecule undergo rather large vibrations, 
but it is still sensible to talk about time averaged position on macroscopic time scales and we 
shall assign these average positions as yi, . . . , y v . Time averaged momentum has dynamical 
significance, so we use it to define the position of a molecule. We will take its position to be 
its mass averaged position, 



ecules, is to 
3, Chapter 



From the same information we can also obtain a measure of the orientation of a molecule. 
That is, we consider a molecule in standard position defined by fixed atomic positions 
xi,...,x„. This standard position could for example be the collection of positions in a 
crystallized form of the molecule, deduced from x-ray crystallography, or from theoretical 
studies of the configuration of single molecules in solution. From these reference positions 



we define the mass averaged reference position as above, 



(4) 



A natural concept of orientation is obtained through the average deformation gradient of 
the molecule, defined in the following way, 

sr=i m i (y< - y) ® ( x i - x ) 



Here, r can be taken as a typical radius of the reference molecule, e.g. 



(5) 



E^i( x » ~ x ) 2 /gx 

(The position and orientation of a molecule will turn out to be independent of r). The 
expression (jSJ) for F is dimensionless and translation invariant. Typically, it will be true 
that det F > 0, which we assume. If the represent a rigid deformation of the reference 
molecule, i.e., = Qxj + c, i — {1, . . . , i/}, Q G SO (3), then it follows from (jSJ) that 

F = QV. (7) 

where 

-17- ^i=l m i ( X * - jOg (Xj - X) 

r E<=i m * 

The latter is interpretable as a normalized reference moment of inertia. In general we will 
use the rotation in the polar decomposition of F as a measure of orientation. That is, we 
will write 

F = RU where R G SO(3) and U = U T is positive -definite, (9) 

and define R in this decomposition as the orientation. For molecules that deform as well as 
rotate and translate, R is still a natural measure of orientation. 

These definitions of position and orientation, the latter defined by a polar decomposition 
of the apparently complicated formula for F, have several attractive features. First, we 
observe that both F and y are linear functions of the positions yi, . . . y v . We have framed 
the definitions in this way so that their second time derivatives are immediately related to 
(time averaged) forces via the equations of molecular dynamics. Second, there is a useful 



variational characterization of R and y. That is, R(xj — x) + y is the rigid deformation that 
best approximates the mass distribution of the molecule in the least squares sense. More 
precisely, if we consider the rotation matrix Q and vector c that minimize jsjj, 
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c,QeSO(3) 



J |y(z)-[Q(z-x)+c)]| 2 C /m(z), (10) 



where dm is the mass measure of the molecule, i.e., m = 'Y^Tn i 5 Xi , and y(xj) = y,, then 
it follows from the simple quadratic minimization problem (fTU|) that Q = R and c = y as 
defined by (J3J) and The proof of this fact is straightforward. First, by differentiating 

(I10J) with respect to c we conclude immediately that c = y. Then we replace c = y in (I10J) 
and simplify. The minimization over Q G SO (3) then becomes 

min (-tr(Q T F)) = - max tr(Q T F) = - max tr(Q T RU) 

QeSO(3) QeSO(3) QeSO(3) 

= — max tr(QU) = — max > A^ • Qej, (11) 

QeSO(3) QeSO(3) 

where {ei, B2, are orthonormal eigenvectors of U with corresponding positive eigenvalues 
{Ai, A2, A3}. It is immediately seen that the latter maximization problem is uniquely solved 
by Q = I, implying that Q = R. 

The orientation R can be obtained from the formula R = FU _1 = F(\/F T F) _1 , the 
square root being the unique positive-definite square root. For the purpose of a constrained 
molecular dynamic simulation (with given R) it is useful to have a linear constraint. A 
necessary 32] condition that R is related to F by Q is that Skew(RF T ) = 0, that is, 

Skew (R (E^m* (x, - x) ® (y, - y))) = 0, (12) 

which is a linear constraint on the y». 

In summary, our basic kinematics of a molecule is specified by a pair (y, R) defined by 
(jUJ) and (0) , (jHJ) , Q • We wish to emphasize again that this choice of kinematics does not 
entail assumptions of rigidity of molecules. These formulas do allow one to determine our 
kinematic variables from unrestricted first principles calculations. 

We note that there are more complicated theories possible with this general type of 
kinematics. A further possible generalization would be that the free energy is affected by 
position and and full deformation gradient of molecules defined by ©• 



III. FREE ENERGY 



Although all atoms have infinite range, each protein molecule interacts primarily with 
its close neighbors and we shall develop the theory on this basis. Because bonding sites are 
often localized and interlocking, molecules are expected to exert both forces and moments 
on neighboring molecules. 

We base the theory on a formula for the free energy. We make two simplifying assumptions 
which easily could be generalized: 1) the molecules are identical (Thus we can use the 
reference configuration introduced in Section |H] for all molecules), and 2) we only consider 
nearest neighbor interactions. For chains, the "nearest neighbor" of i G {2, ...,N— 1} 
refers to the two molecules % — 1 , i + 1 for interior molecules while it refers to the molecule 
2 for i = 1 and N — 1 for i = N. For sheets there are various possibilities. One can have 
triangular lattices with each molecule bonded to 6 nearest neighbors or rectangular lattices 
with each molecule having 4 nearest neighbors or more complicated situations. In the case 
of 4 nearest neighbors the nearest neighbors of (i, j) consist of all molecules of the form 
{i + 1, j), {i — 1, j), (i, j + 1), (i, j — 1) that lie in V. If not all four of these are in V we call 
(i, j) a boundary molecule; otherwise, we call it an interior molecule. Here we write the free 
energy only in the case of 4 nearest neighbors, the generalizations being automatic. 

Since such protein arrays are of interest in solution, the assumptions are somewhat differ- 
ent than would be appropriate for atoms in a polymeric chain or a crystal. In particular the 
"free energy" will be taken as the free energy of the protein assembly and a fixed volume V 
of the surrounding solution. This is appropriate to the case that V is surrounded by a large 
bath B having fixed temperature and fixed chemical potentials of species in solution. All 
free energies below will depend on the temperature and chemical potentials, but since these 
will be fixed throughout this paper, we will leave these parameters out of the notation. As 
is well-known the presence of the solution profoundly affects the free energy of the protein 
through osmotic effects, but it also affects the form of the free energy. In particular, bound- 
ary molecules may have a free energy that is different from interior molecules, because one 
or more of their bonding sites is unbonded and exposed directly to the solution. 

In this simplest situation we will assume that, for chains, there is a molecular interaction 
free energy which depends on the position and orientation of a pair of molecules ^(a, R, b, Q) 
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defined for a, b e IR 3 and R, Q e SO(3). In order to accommodate the possibility of 
boundary effects, we distinguish the free energy contribution from the interaction of the 
first and second molecules, ^i(sl, R, b, Q) and the next to last and last ones, iPn(&, R, b, Q). 
The total free energy is then, 

N-2 

tf(yi,Ri,..., y N ,R N ) = ^ ip(y h R h y i+1 , R m ) (13) 

i=2 

+ V ; i(yi ) Ri,y2,R2) + ^(yjv-i,R-iv-i,yiv,Rjv) 

The pairwise form of this free energy is justified by the presence of localized bonding sites. 
While it would seem like the molecular free energies and ipN could be quite different from 
each other and from ip, in fact this is not true within the present context. To see this, consider 
a chain {1, . . . , M, M + 1,..., iV} and translate uniformly the molecules {M + 1, . . . , N}. 
Physically, if this translation is large, this should give the sum of the energies of two separate 
chains. If one writes this out for an arbitrary configuration, one finds that, in fact, ipi — 
ipN = tp, so the preceding energy is in fact, 

N-l 

1=1 

The main physical assumption embodied here is that the contribution to the free energy from 
a pair of molecules is unaffected by the positions and orientations of all other molecules in 
the chain, in keeping with the idea that the main free energy changes are due to changes in 
conformation at the bonding site between a pair of molecules. 

For sheets the assumptions are analogous. In this case there are 15 different kinds of 
boundary molecules, depending on which of the four bonds is missing. To simplify the 
notation we let B be the set of boundary molecules and write the total free energy as 

^(yi,i) R<i,i , • • • j yn,m, Rn,m) 

(i,j)ez 2 nv\B 

+ E <v (15) 

where ipfj is a free energy for the boundary molecules. Each ipfj is one of the 15 functions 
describing the free energy of interaction of boundary molecules with dependence on the 
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position and orientation of neighbors that are present. In writing this free energy, we have 
effectively assumed that all bonding sites of between molecules i, j and + l are the same, 
independent of i and j, and the same for sites of the form i,j and i + 1, j. Also, in many 
interesting cases not all of the 15 kinds of boundary molecules are represented. For example, 
in isolated T4 tail sheath as usually pictured, there at only two kinds of boundary molecules. 

The condition of frame-indifference restricts the form of the molecular free energies. We 
note first that, according to the definitions Q and Q, the quantities yi and R4 are trans- 
formed into Ryi + c and RR4 under a superimposed rigid body motion Ryj + c of all 
the atoms. Thus, in the case of chains, the condition of frame-indifference is (for interior 
molecules) 

^(Yi, Ri, y 2 , R 2 ) = ^(Ryi + c, RR 1? Ry 2 + c, RR 2 ), (16) 
which must hold independently for R, R 1; R 2 6 SO(3), and c, y 1; y 2 G M 3 . Making the 
special choice R = Rf and c = — Rfyi, we see that 

V>(yi, Ri, y 2 , R 2 ) = ^(0, 1, Rf(y 2 - yi), RfR 2 ) = <p(t, Q), (17) 

where Q = Rf R 2 is a relative orientation that is unaffected by rigid body rotations and 
t = Rf (y 2 — yi) is a relative translation which is also unaffected by superimposed rigid 
motions. Note that Q is not simply the rotation which maps the orientation of molecule 
2 into that of molecule 1 (or vice versa), and similarly, t is not a simple translation of 2 
into 1. Rather, these quantities behave like strains, and describe the 6 degrees of freedom 
associated with the straining of bond sites caused by changes of orientation and relative 
position of 1 and 2. 

For sheets the restrictions of frame-indifference are similar: the functions ipi and ip2 in 
(JT3J) both satisfy (JT7J). We let <pi ;2 be the corresponding reduced functions defined by (JT7jl 
subscripted by 1,2. 

IV. COMPATIBILITY 

Compatibility concerns the extent to which one can prescribe the quantities describing 
strain, or, more generally, the functions on which the free energy depends after the condition 
of frame-indifference has been imposed. In our case this concerns the extent to which we 
can assign the relative translations and relative orientations. 
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In the case of chains we therefore assign (ti, Qi, . . . , tjv_i, Qjv-i) and ask whether there 
are positions and orientations (yi, Ri, . . . , yjv, Rat) consistent with these in the sense that 
tj = Rf (y%+x — yi) and Qi = Rf Rj+i, i — 1, . . . , N — 1. It is immediately seen that these 
conditions are solvable, and all solutions are related to each other by exact rigid motions of 
the entire molecule. Hence, the problem of compatibility for chains is analogous to the case 
of 1-D rod theories in continuum mechanics: there are no conditions of compatibility and 
the freedom is precisely overall rigid deformations. 

For sheets there are nontrivial restrictions of compatibility, as expected based on con- 
tinuum shell theories. We begin by considering the case of 4 bonding directions. For the 
analysis below a path is a succession of nearest neighbors in T> and a loop is a closed path. 
We assume here that T> is discretely simply connected in the sense that every point is con- 
nected to every other point by a path and any non-self-intersecting closed loop in T> D Z 2 
is the boundary of the union of (closed) unit squares contained in T>. We assign relative 
translations and relative orientations 

(t M ,Q M ), (i,])eV r and (t^Qij), {i,j)eV u , (18) 

where V r,u are the subsets G V such that (i + (resp. + 1)) are also in V (r 
denotes "right" and u denotes "up"). We ask whether there are positions and orientations 
yij, Rij, G V, that satisfy 

— • i,/ - yij) i 

Q i = R/jR,- • • ('../) P'- 

= &i,j(yi,j+i - yij) > 
Q - = R£-Rij+i, (u')e^. (19) 

Immediately we see that there are some restrictions. For example, if Rjj has been deter- 
mined, then by successive application of JH 2 ,4 there are overdetermined equations for, say, 
Ri+ij+i, these being, 

Rj+ij+i = Rjj'+iQjj'+i = Rj,jQi,j'Qij+i 

Ri+ij+i = Rj+ijQi+i,j = Ri,jQi,jQj+ij- (20) 

Equating these, we get 

Qi,jQi,j+l = Qij'Qi+l,j) (21) 
1 e 



or, 

QijQij+iQi+ijQij — I) (22) 

This has the following interpretation: as we go say clockwise around a unit square in 
the lattice Z 2 , the product of the Q's (taken with transpose if the path goes to the left 
or down) is the identity. Two neighboring squares, both traversed clockwise, give such 
identities of the form QiC^Q^Qf = I and QaC^Q-fQs = I which immediately gives 

I = QfQiQ2QjQ3Q4QjQ^ = QfQiC^Q^Cg, that is, QiQ 2 Q 4 Ql^Qf = I; this 

is a similar compatibility condition for the rectangle consisting of the union of the two 
squares. By induction and using the discrete simple connectedness J3] of V, this extends to 
any non-self- intersecting closed loop in T> D Z 2 . 

So far, the argument concerns the solution of the last two equations of (J19)) . For the 
translations, by again traversing a unit square in the clockwise sense, we have from (119(1 1 ,3 
that Rjjtjj + Ft, j_|_itjj_|_i — R,+ ijt i+ i,j — ^H,j^i,j = 0) which, after premultiplication by H i • 
gives 

Qij ^ i j+1 Qi,i^i+lji — 0. (23) 

As above, equations of this form for neighboring squares can be combined to an equation of 
compatibility for a rectangle and then, by iteration, to a non-self-intersecting closed loop. 

By this time it is clear that the pattern of argument is essentially the same as that for 
differentials (i.e., this kind of argument does not really use that the differentials are small, 
if only nearest neighbors interactions are considered). That is, necessary and sufficient 
conditions for (fTS() to be compatible are that the compatibility conditions for unit squares 
in £>, i.e., all equations of the form 

Qj,iQi,j+iQi+ijQjj = I) 
tjj ~\~ Qijtij-j-i Qjjtj+ij tjj = (24) 

are satisfied. The necessity of these conditions has been proved above. The sufficiency 
follows by giving arbitrarily y 00 G M 3 ,Ro,o G SO(3), assuming without loss of generality 
that (0, 0) is in T>. Then, for any other (k, m) G T> \ B, we consider a path from (0, 0) to 
(k,m). Successive application of (fTT?|) determines first H,k,m and then yk, m , and every such 
R, y along this path. By a process of exhaustion, i.e., construct a path which does not 
cross itself or any other path to a point whose values R, y have not been determined from 



a previously determined one, we then determine all values Rjj, Yij- These satisfy all of the 
equations (|19|l. That is, by construction, a point and neighbor (resp., (i, j + 1)) 

are each connected to (0, 0) by a path used in the construction. These paths may coincide 
over some initial length, but, once they depart from each other, they never intersect. Thus, 
by possibly shortening the loop, we can without loss of generality assume the paths form 
a non-self-intersecting loop with a single link removed. Satisfaction of ()19|) then holds as a 
consequence of the compatibility condition for such loops. 

There is clearly also uniqueness of the construction of Rjj, Yij up to the choice of 
Ro,0)yo,0) which, by the frame-indifference of the quantities Q,t is equivalent to unique- 
ness up to overall rigid deformation. 

Suppose now we add additional bonding directions. Since the equations (jUJ) are both 
necessary and sufficient for the existence of the positions and orientations, and these positions 
and orientations are uniquely determined up to overall translation and rotation (which does 
not affect the (t,Q)'s), then all values of (t, Q) corresponding to other bonding directions 
are uniquely determined by (tjj, Qij), (t*j, Q»,j)- One can write formulas for these. For 
example, if (as in T4 sheath) we have the additional bonding directions — (i — l,j + 1), 
then 

U,j = HfjiYi-lj+l — Yi,j) — Qj-l,j(^i-l,j ~~ ti-lj); 

Qij = R^-Rj-ij+i = Qj-i jQi-ij. (25) 

In summary, with additional bonding directions, necessary and sufficient conditions for com- 
patibility are (JHJ), together with the formulas of the type (|23j) that uniquely determine the 
values of (t, Q) for these additional directions in terms of (tjj, Qij), (tjj, Q»j). 



V. HELICAL CONFIGURATIONS 

As shown in Figure 0] below, the tail sheath of bacteriophage T4 is a sheet consisting of 
the union of two families of helices. Helical structures arise often in biology and they also 
have special position within the context of the present theory, as we explain in this and the 
following sections. 

From a purely geometric viewpoint H. R. Crane in 1950 H argued that if two proteins 
have complementary bonding sites and molecules bond at a specific angle, then chains of 
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these molecules are likely to from helices. In nonlinear elasticity of rods and plates, helical 
configurations also arise in a natural way (Chouaieb and Maddocks, Moakher, Maher and 
Maddocks, jj], 0), and in 3-D nonlinear elasticity there are deep connections between the 
existence of helical configurations in equilibrium, invariance and Noether's theorem (Ericksen 
In all of these arguments the frame- indifference of the free energy plays a central role, 
allowing the variables describing strain to be either constant (in rod theory) or else to depend 
on fewer reference coordinates for these special configurations. 

We begin with chains and recall from (J 17)) that the molecular free energy depends on 
relative translation and orientation (t,Q). Motivated by the examples just cited we first 
try to figure out what are all configurations (yi, Ri , . . . , yjv, Rjv) having constant values of 
(t, Q). In the following section we explain the energetic significance of this choice, beyond 
the obvious fact that such configurations have the property that the molecular free energy 
is independent of the molecule. This problem is immediately solved by the considerations 
of compatibility of the preceding section; we have to solve 

Rf(y i+ i-y;) = t, RfRj+i = Q, i = 1,...,N- 1. (26) 

and the general solution is, 

i-1 

K l+1 = R 1 Q i , y i+1 =y 1 +R 1 ^Q J t, i = 1, . . . , JV - 1, (27) 

j=0 

where yi G R 3 ,Ri G SO(3) are arbitrary, and they are also the values of position and 
orientation corresponding to molecule 1. It is clear from ()27|) that the choice yi G R 3 , Ri G 
SO (3) also corresponds to an arbitrary superimposed rigid deformation of the whole array. 

The positions of the molecules described by the equations (J27|) lie on a helix. To see this 
put Ri = I and note that Q = I corresponds to the degenerate case of a molecules spaced 
equally along a line all with the same orientation. So we assume henceforth that Q ^ I. 
Then Q has an axial vector, that is, a vector e G M 3 whose direction is uniquely determined 
such that Qe = e. By suitable choice of the magnitude of e, we can decompose t = t" + 1 -1 , 
t 1 - ■ e = 0, t" || e; then the second of ()27|) becomes 

i-1 

y,;+i =yi+*t" + ^QH ± . (28) 



The last term in (|28j) can be further simplified. To do so, note that Q — I is invertible on 
the plane perpendicular to e and define r by (Q — I)r = t- 1 , r ■ e = 0, so that Qr = r + 1- 1 . 
Now, iterate the latter to get the identity, 

i-X 

QV = r + ^Q J t ± . (29) 
i=o 

Choosing the arbitrary translation yi = r (to put the origin on the axis of the helix) and 
eliminating the sum in (J28|) using (J29j) we have, 

y i+1 = it« + Q i r, (30) 

which, accounting for the conditions r ■ e = and Qe = e, is the equation of a helix. The 
orientations Rj of these helical configurations also vary in a regular way along the helix in 
a manner given by (|27|) and illustrated, for example, in Figure El 

The basic geometric information, like formulas for the pitch and radius of the helix in 
terms of the given information (t, Q), can be read off from the formulas given in the preceding 
paragraph. In particular, if v > 1 is the smallest number such that = I, then the pitch 
is v |tH|. The radius is KQ-TrVl, the inverse taken on the plane perpendicular to e. This 
inverse is given by (Q - I) -1 = ^q 1 ^ (Q - I) T 

Below, we will observe that the relaxed configurations of extended or contracted T4 tail 
can be viewed as a collection of 6 helices, each with 24 molecules. To evaluate the positions 
and orientations of all these molecules from experimental data, it will be useful to understand 
how the orientations of the molecules on a helix can be varied independently from the shape 
of the helix. This is not immediately obvious from the formulas (|2"7j) . but is easy to work 
out. First it is clear geometrically (and it can be shown from the formulas above) that if the 
shape of the helix is given, i.e., all of the y^, then assignment of the orientation of one of the 
molecules on the helix determines the orientations of all the others. Thus there is expected 
to be one free rotation matrix R to define this orientation. Given a helical configuration 
defined by (yi,Ri,Q,t), then all other helical configurations with the same positions are 
given by 

(yi,RiR,R T QR,R T t), RgSO(3). (31) 

To use this formula, one can think of beginning with a helix of the desired shape and then 
choosing R 6 SO (3) so that RiR is the desired orientation of molecule 1 and all the others 
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then follow. 

For sheets there are similar kinds of helical configurations, having the shape of a ribbon 
bent and twisted into a helix. These are discussed below in Sections IVHI and IIX1 



VI. EQUILIBRIUM, FORCES AND MOMENTS 

In this section we work out completely the equilibrium conditions for chains. 

We shall repeatedly have to take a derivative of functions with respect to a rotation 
matrix, so we first briefly explain the meaning of that here. (It is well-known how to deal 
with the fact that rotation matrices lie on the manifold SO (3) and therefore their components 
cannot be varied independently, but we need to explain the notation). Recall that for an 
arbitrary skew matrix W = — W T the series I + sW + |s 2 W 2 + . . . (i.e., the quantity sW 
substituted formally into the exponential series) converges for all s and yields a rotation 
matrix. For a smooth function /(R) defined on SO(3) we extend / smoothly outside SO(3) 
in any way, and consider 



d „,„ „. 1 



df(R) „ m df(R). 



d mi + sw + -s^ ••••)*) .„ i^-wR • w. ( 3 2) 

In 3D W = — W T has an axial vector Wa = w x a, Va G M 3 , so the latter is a linear function 
of w. We use the notation df/dw for this linear function, i.e., 

8 .m R T . w = 9 -^>R- ■ (wx) * «P ■ w. (33) 
dR OR ( J dw ( J 

In rectangular Cartesian components, 

df(R)_ df(R) 

~ £ijk ~dR- Rkm ' (34) 

where is the permutation symbol. 

We suppose that the chain is loaded at its ends by generalized forces fi, f/v conjugate to 
yij Yn and M 1; M^r conjugate to Ri, R^r applied, respectively, to molecules 1 and N. The 
total free energy of the chain and loading devices is then, 

(yi, R x , . . . , y N , R N ) - yi ■ fi - y N ■ f N - R x ■ M x - R^ ■ M N 

N-l 

= V>{U, Qi) - yi • fi - YN- fiv - Ri - Mi - Rat ■ M N . (35) 
i=i 

on 



According to general principles, the first variation of the total free energy with respect to 
rigid translations is the balance of forces and its first variation with respect to rigid rotations 
is the balance of moments. The former, i.e., the derivative of (f33j) with respect to s at s = 
of a variation y"j(s) = y« + sc, for arbitrary choices of c, yields, by the frame- indifference of 
the quantities t, Q, 

fi + f;v = 0, (36) 

indicating, as expected, that fi and f/v are interpretable as simple applied forces. Similarly, 
the first variation y;(s) = (I + sW + . . . )y u R;(s) = (I + sW + . . . )R* with W = -W T 
independent of i yields, using the arbitrariness of W and the formulas 



(yi - Yn) x fjv + mx + m N = 0, (37) 
where (in the notation of (HUH)) 

g(Mi ' gj d(M N ■ R) 

mi = to~> mN = to~> (38) 

evaluated, respectively at R = R 1; Rat. In rectangular Cartesian components, 

mij = —£ijkMi irn Rikm, rn^ j = —EijkMNim Rn km- (39) 

Clearly, (|3T|) is a balance of moments and rri!, m^r are pure moments applied to molecules 1 
and N, these being related by the somewhat nonobvious formulas to the generalized "forces" 
Mi, Mat. 

Having established these interpretations we now take a general first variation of the total 
free energy, yi(s) = y, + sUj, Rj(s) = (I + sWj + . . . )Rj, where now the positions and 
orientations of molecules are varied independently. The previous variations being perfectly 
legitimate, we may as well assume that the generalized forces satisfy ()36|) and ([37)1 . in which 
case we write the total free energy as, 

N-l 

V (Rj(ym " Yi) , R^Ri+i) + (yat - yi) • fi - Ri ■ M x - R^ • Mat. (40) 

i=l 

Inserting these variations, differentiating with respect to s, evaluating at s = and using 

the arbitrariness of Uj, Wj we get after some manipulation, for interior molecules, 

d<p(ti,Qi) 
at 

(y<+i - Yi) x fjv - + uai,i+i = 0, i = 2, . . . , N - 1, (41) 

Ol 



where rn^+i is given compactly by using the notation (jBUjl - fpMj) : 

mtM = R ,^Q£). (42) 

The argument leading to (jH]) 2) is somewhat involved, so we present the details in the 
Appendix. 

For boundary molecules we have 

(y2 - Yn) x fjy + m 1)2 + m N = 0, 

mjv-LW + m N = 0. (43) 

From these equations we interpret m^ +1 as the moment on molecule £ + 1 produced by 
molecule £, and Hed(p(tg, Qt)/dt as the force on molecule £ exerted by molecule £+1, and, 
even though the latter is the force on molecule £, we should interpret this force as acting at 
the center of mass of molecule £ + 1. 

Helical configurations have a special status with regard to the interior equilibrium equa- 
tions. Recall that these configurations are defined by saying that the relative translation 
and orientation are independent of the molecule, and they are characterized by and 
()27|) . Let (t, Q) be the given relative translation and orientation associated with a helical 
configuration. It follows from the first of (}4*Tj) that Rf f/v is independent of i and then, using 
the first of (|2*7|). that Rffjy is on the axis of Q: 

Qc = c, where c = Rffjv- (44) 



From the results of Section |V| RiC is the axis of the helical chain. Thus, (|44|) has the 
interpretation that helical configurations are consistent with axial forces only. One can think 
of this physically in the following way: given an applied force f^v, the overall rotation Ri 
will adjust itself to make the force axial. It then follows that the N — 3 interior equilibrium 
equations (j4*T|) i collapse to the single vector equation 

= c = Rjf N . (45) 

Now we turn to the second equilibrium equation (J41j) o for the moments. Using (J42j) and 
we get that the N — 3 interior equations (jH} 2 collapse to the single vector equation 

txc + (I-Q^=0. (46) 



Note that the c-component of the latter is an identity, so comprises effectively two 
equations which relate the applied force to the projection of the moment on the plane 
perpendicular to c. The Ric-component of (JHJb requires (— R/fmj-x^ + Rfm^+i) ■ c = 
0, % = 1, . . . , N — 1. Therefore from the boundary equation (|43|h the c-component of the 
moments is defined by, 



Mathematically, we can view the equation (}44j) in the following way. Given the applied 
force fjv, the magnitude of c is determined by c = Rffjv- Then, the unknown Q determines 
the direction of c as well as Ri, up to an arbitrary axial rotation. In addition, we have 
freedom to prescribe the applied axial moment R^m^v • c. We then recognize that the 
equilibrium equations (|45p. (|4l)j) and (|4"7j) comprise 6 equations for the determination of 
the 6 unknowns (t, Q). Depending on the convexity properties of ip, these are expected to 
determine (t, Q), but we do not explore this here. 

VII. BACTERIOPHAGE T4 TAIL SHEATH 

In this section we specialize the formulas given above to T4 tail sheath. The first task is to 
describe the sheath in its extended and contracted configurations and identify the positions 
and orientations using experimental measurements. 

T4 tail sheath can be viewed as a protein sheet as defined above. We can think of a 
cylinder oriented vertically. The lowest annulus of the cylinder is a circle of 6 molecules. Each 
of these 6 molecules generates a right handed helical chain consisting of 23 molecules 
Hence, we will identify the molecules accordingly, (y^j, Ri,j), i = 1, . . . , 6, j = 1, ... ,23. 

We work in the usual orthonormal basis (ei, e2, 63) and without loss of generality we will 
choose the overall rotation and translation so that the axes of all the helices coincide with e3 
and the first annulus lies in the e 1; e 2 plane. The first annulus is a circle (i.e., a degenerate 
helical chain). According the results in Section this case corresponds to the case where 
t = to is perpendicular to the axis of Q and a little calculation shows that without loss of 




(47) 



generality (by suitably rotating the six molecules about the e3 axis) we can assume, 



' cos6 -sinO 0^ 



t = (-p,0,0), Q e 



Miifl cosfl , (48) 

V V 

p > 0, and the fact that the 6 molecules are equally spaced on the helix gives Qg = I ==>- 
6 = tt/3. Without loss of generality we write yi = p(l/2, v3/2, 0). The radius of the circle 
of positions is p. 

Emanating from each of these six molecules is a helical chain whose first molecule has 
now a given position and orientation. According to results of Section we need to specify 
(t, Q) for each of these chains. In fact all of these chains have the same (t, Q) because 
suitable rigid rotations and translations bring them into coincidence with each other: the 
whole configuration of the tail sheath has 6-fold symmetry. The axis of Q is again e3 so 
Q has the form (jlEJta, Q = Q7- Thus, besides the radius p of the cylinder, we need to 
determine the four parameters 

7 and t = (n, r 2 , A). (49) 

For this purpose we first show that t±,T2 are determined by p and 7. Referring to Section 
EJand using that the initial point is yi = p(l/2, v3/2, 0), we have from the equations 
t = t" + t- 1 , (Q — I)r = t- 1 , r = yi that (Q — I)y x = t- 1 , from which r 1; r 2 are given by 



P 

Tl= 2 



^(cos7 — 1) — v^3sin7^ , r 2 = ^ ^sin7 + ^(0057 — l)j (50) 



in terms of p and 7. It remains to determine p, A, and 7. The values of these depend on 
whether we consider extended or contracted sheath. 



A. Extended tail sheath 

Extended sheath has an interesting geometric property that we term the 8/3 rule. The 
rule is that the 8 th molecule along one of these helices, beginning at a molecule on the 
first annulus, lies directly over the third molecule away counterclockwise along the annulus, 
Figure El (The justification of this rule from measured data of Leiman et al. ^l] is given 
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FIG. 2: Illustration of the 8/3 rule of extended sheath. Left: the first 8 molecules (i = 1, j = 
1,...,8) on the main helix, viewed down the axis of the cylinder. Right: the first 3 molecules 
{i = 1,2,3, j = 1) on the first annulus, again viewed down the axis. The slight touching of 
domains of neighboring molecules on the right picture is an artifact of the ellipsoidal approximation; 
in reality these do not touch. 

at the end of this section.) Specifically, in our notation, 

yi,s ■ ei = y 3 ,i ■ ei, yi, 8 ■ e 2 = y 3 ,i ■ e 2 , (51) 

This statement and the helical structure of the tail imply full periodicity, y(ij+7) • &i,2 = 
yu+2,j) ' e i,2 whenever these are defined. With regard to the present theory, all the good 
properties of helical configurations discussed above (and below) would hold without this 
"accidental" periodicity. This suggests that its presence is perhaps related to something 
other than the function of the tail, possibly its self-assembly via annulus-by- annulus epitaxial 
growth. In this regard, if one omits the last annulus, then the rest of the tail is exactly 1 
period. In other words, without omissions, the 22 rd annulus lies directly over the 1 st annulus. 
It will be seen below that this 8/3 rule also applies to the orientation, R(i,s) — R(3.i)- This 
is of course the smallest period exhibited by the tail sheath. Possibly these facts are related 
to process by which the tail tube directs the growth of the tail sheath, which is assembled 
in the extended form (Below, the contracted form will not have this or a shorter period). 

The two equations (joTj) give apparently two restrictions on the remaining parameters 

OK 



Extended Centers of Mass 

mass xi X2 X3 radius 

Domain 1 1679.22 -12.40 -64.14 42.32 65.32 

Domain 2 1673.35 29.45 -70.08 49.13 76.02 

Domain 3 751.18 49.25 -93.90 42.32 106.03 

Total 4103.75 15.95 -72.01 45.10 73.75 



Contracted Centers of Mass 

mass x-| x 2 x 3 radius 
Domain 1 1683.34 -51.10 -63.54 45.71 81.53 

Domain 2 2128.82 -46.08 -105.65 35.54 115.26 

Domain 3 2408.94 -64.51 -126.87 40.94 142.33 

Total 6221.10 -54.57 -102.47 40.38 116.10 



FIG. 3: Domain coordinates used to determine orientation, R4 1. The circle denotes the centerline 
of the cylindrical sheath and each molecule is modeled by three domains (The three domains to 
the right represents a molecule of extended sheath). Total mass and center of mass of each domain 
are shown to the right. 

(p, A, 7). Written out using (|27j). these two conditions are 

6 

- Q't + to + Qn/sto || e 3 . (52) 

3=0 

In fact this condition only involves 7 and is equivalent to the pair of equations 

2 + cos 77 — sin 77 = 0, V3 cos 77 + sin 77 = 0. (53) 

These equations have simultaneous roots at 7 = 2tt/21 + 2rr n/7 where n is an integer. The 
root of interest (i.e., corresponding to a fraction of a turn in the counter clockwise sense) is 
7 = 2rr/21. From the form of t it can now be seen that 21 A is the pitch of the helices. 

It remains to prescribe the orientations of all the molecules. As explained in the few lines 
preceding (J3*T)) this is assignable independently of the positions. Since we have put Ri = I, 
we may give this by giving the orientation of molecule (1, 1), from which all the orientations 
of all the molecules are determined. In summary, the following information is needed from 
experiment for extended and contracted tail sheath: 

• The orientation of molecule (1,1) = Ri,i ; 

na 




The radius of the cylinder of centers of masses = p : 
The pitch of the helices = 21 A . 



We obtained these from electron density maps of Leiman et al. [12j (We are grateful to 
Petr Leiman for the prepublication data on extended sheath, without which the present 
theory would be incomplete). See Appendix B for how this data was used to represent the 
molecules. Briefly, this data does not give atom positions, but gives an excellent picture of 
relatively rigid collections of atoms called domains. Both extended and contracted sheath 
consist of three such domains. We assumed charge neutrality and computed centers of mass 
of domains, then used the formulas ©, ©, and (jHJ) to compute the position and orientation. 
Three issues should be noted: 1) with three domains F is singular with rank equal to two; 
nevertheless, R is uniquely determined by 0- 2) This data was rotated about the axis 
of the helix and translated into the position of molecule (1,1) ). This gives, p = 73.75 A, 
A = 40.6 A. We chose extended sheath to be the reference configuration so that R^i = I. For 
contracted sheath Ri i is given by (|5fij) below. 3) Note from Figure Elthat masses of domains 
are not conserved. This is a consequence of the flexibility of certain bonds, which causes 
some mass to be lost by the averaging procedure inherent in any 3D reconstruction. To give 
definite results we ignored this problem and used the measured masses of each domain. 

B. Contracted tail sheath 

For contracted tail sheath the evaluation is completely analogous to the above except 
that the 8/3 rule is replaced by a 12/1 rule, 

yi,i2 • ei = yi,i • e x , y 1>12 • e 2 = yi,i • e 2 . (54) 

As above this leads to a pair of equations for 7, 

1 - cos(ll7) + V^sin(ll7) = 0, V3 - V3cos(ll7) - sin(ll 7 ) = 0, (55) 

having simultaneous first positive root at 7 = 27r/ll. As above, to complete the description, 
we need the orientation of the first molecule Ri,i, the radius of the cylinder p, and the pitch 
of the helices, which in this case is 11 A. Using the electron density maps of Leiman |ljj in 
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FIG. 4: Pictures of extended and contracted tail sheath based on the formula (|57jl. using the 
method of visualization described in Section [VII Bl 

the same manner as above, we get for contracted sheath p = 116.1 A, A = 16.4 A, and 



R-i, 



V 



0.426 0.4388 -0.791 
-0.4378 0.8653 0.244 
0.7916 0.242 0.561 



\ 



/ 



(56) 



In summary, the configuration of extended or contracted tail sheath is given by the 
following equations: 



i-2 



i-2 



fc=0 fc=0 

z = l,...,6, j = l,...,23, 



(57) 



where 7 = 27r/21 for extended and 7 = 27r/ll for contracted tail sheath. Here, Qe,t are 
defined by @g), yi = p(l/2, 0), t = (-p,0,0), t = Ae 3 + (Q 7 - I)yi and we use 

the convention Q° = I (Also, sums of the form Xlfclo where m < are simply put equal 
to zero). Pictures of extended and contracted tail sheath obtained from formulas (J57)) with 




FIG. 5: The main helix of T4 tail sheath in extended and contracted forms, illustrating the screw 
action. 

the data given above are shown in Figure 0] The method of visualization is to approximate 
the domains of the molecules of extended and contracted sheath by ellipsoids, centered at 
the centers of mass of the domains, as described in detail in the Appendix B. As can be 
seen there, this is quite an accurate representation of the molecule. Then we applied the 
formulas ()57|) to this collection |35j ]. 




There is a substantial screw action that occurs when the sheath fully contracts. This can 
be seen from Figure El which shows the corresponding main right handed helices in extended 
and contracted sheath. If the baseplate is held fixed during contraction, the neck experiences 
almost a full turn, the angle change being about 343 deg rees. 

We should add that the data of Leiman et al. also provides a direct measure of 

the validity of the 8/3 and 12/1 rules, which we have used above to evaluate 7 = 27r/21 = 
17.14° and 7 = 27r/ll = 32.73°, respectively. The direct measurement of Leiman et al. gives 
the very nearby values 7 = 17.2°, 32.9°. 
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FIG. 6: Height of the tail sheath vs. end angle (measured from extended sheath) according to 
the constraint, showing a strong first-order Poynting effect. Dots correspond to extended and 
contracted sheath. 

VIII. A SIMPLE CONSTRAINED THEORY FOR BACTERIOPHAGE T4 TAIL 
SHEATH 

A. Constraints 

Our general expression for the free energy ()15j) of a protein sheet can be quite complicated, 
and in our case is made more complicated by the presence of additional bonding directions, 
as we explain below. In this section we use the known configurations of the sheath to make 
simplifying assumptions that allow us to arrive at a manageable form of the energy. 

First we note that each molecule in the sheath undergoes a substantial motion. Never- 
theless, there are some simplifying features of this motion. These features are remarkably 



close to the ideas of Pauling 



liym 

3, 



who, prior to any knowledge of T4 tail sheath, theorized 



that arrays of the helices of Crane could contract by ha ving adjacent turns of the helix 



form bonds. Later, in his study of T4 tail sheath, Moody 



1511 observed that bonds on the 



right-handed helix remained to some extent invariant during contraction. He noticed that, 



qo 



on the main helix, while there is a substantial relative rotation of molecules, the distance 



between neighbors does not change too much. This concerns |36J] neighboring molecules of 
the form + 1), shown e.g. for i — 1 in Figured According to our equation (JH7|) we 

have for both extended and contracted sheath that this distance is |t| = |R^-(y i)3 - +1 — yij)\; 
it is of course independent of both i and j and is given by 

|t| 2 = A 2 + 2p 2 (l-cos7). (58) 

When this is evaluated for extended and contracted sheath using the data above we get, 
respectively, |t| = 46.2, 67.4 A. While these are fairly close, Moody noticed that if, instead of 
using the separate radii of extended and contracted sheath, one uses in both cases an effective 



radius of p = p c s = 77. 6A, then ^A 2 + 2p 2 ff (l — cos 7) = 46.7 A for both contracted and 
extended sheath. The reason for the smaller-than-average effective radius presumably relates 
to the relative importance of the bonding of inner domains, which appear to be in contact 
in EM cross-sections of the sheath at a radius near p e g. 

We remark that if we approximate cos 7 by 1 — (1/2) 7 2 in the expression A 2 + 2p 2 ff (l — 
cos 7), and also adjust the value of p c s slightly to p c s = 76.33 A , then we have the following 
simple quadratic condition: 

„ „ „ I 2170 A 2 for extended sheath, 
A 2 + 1 2 Pin ={ „ 2 (59) 

I 2170 A for contracted sheath. 

In view of its physical interpretation, we assume (J59j) represents a special stiffness in T4 tail 
sheath and we adopt it as a constraint for all values of A and 7. Below we generalize it to 
distorted configurations. 

The constraint (J59|) has an interesting consequence. To describe this, we first recall that 
according to macroscopic nonlinear elasticity, a uniformly twisted cylinder subject to zero 
axial force and free sides changes its diameter and also its length. The latter is referred to as 
the Poynting effect. It is generically a second order effect: the elongation goes as the square 
of the angle of twist of the cylinder; the elongation can be either positive or negative and 
it is typically positive (lengthening) for elastomeric materials. For uniform states of T4 tail 
sheath, that is, states given by the formula (JH7j) subject to the constraint . we have a 
very strong first-order Poynting effect. That's because, by Ij57j) . the the end angle measured 
from the extended configuration is 227 while the height is 22A. 




FIG. 7: Left pair: the first annulus (i = 1, . . . , 6, j = 1) of extended and contracted sheath viewed 
down the axis of the cylinder, with the axis of the rotation of molecule 1 shown passing through 
its center of mass. Right pair: the first three molecules on the main helix (i = 1, j = 1,2,3) of 
extended and contracted sheath viewed parallel to the axis of rotation (shown as the black dot) 

The predicted height vs. twist relation is shown in Figure |H1 This is essentially a plot 
of the constraint ()59j) . We note that if the approximation cos 7 ~ 1 — (l/2)7 2 is not made, 
then, on the scale of Figure El the resulting graph is indistinguishable from Figure El Note 
the dramatic Poynting effect, particularly at contracted sheath. It would be interesting to 
look at this relationship experimentally. 

There is another simplifying feature of the deformation of T4 tail sheath that concerns 
the orientation. For molecule (1, 1) the rotation that maps extended to contracted sheath is 
given in (JoT)|) . and its axis is given by 

(0.001,0.875,0.485). (60) 

The angle of rotation is close to 64.8°. Remarkably, the axis of rotation (JrJU|) is within about 
1° of (0, y/3/2, 1/2). A possible reason for this rotation and its implications become clear 
when we superimpose the rotation axis (black line) on pictures of molecule (1, 1) of extended 
and contracted tail sheath, Figure From these pictures, if one thinks of the molecules 
as having the shape of a kind of twisted banana, then evidently the axis of rotation passes 
through its axis. Thus, the rotation of molecules of tail sheath seems largely constrained by 
steric hindrance. But there is another feature of this rotation that is suggested by the two 
pictures on the right of Figure In these two pictures we are looking directly down the axis 
of rotation. One can see that the rotation of (1,1) of about 60° is causing it to align itself 
approximately with the main helix. As above, this is consistent with the idea that there are 



strong bonds linking molecules on this helix that not only constrain lengths but also relative 
rotations. In fact, even though the molecules depicted at the right of Figure [7| do not touch, 
there are strong bonds that link the innermost domains. 

We now develop this idea quantitatively. To account for the evidence for steric hindrance, 
we assume that the orientation R x i has the fixed axis which we take to be (0, y/3/2, 1/2), 
but we allow the angle of rotation to be free for the moment, i.e., 



Ri,i(0) 



( cos9 ±sin# -^sin# ^ 

-isinfl f + icosfl ^-^cos# 



(61) 



^sin0 ^-^costf i + fcos# J 

Guided by the pictures on the right of Figure and the motivation above, we compute 
R-i\(0)(yi,2 — yi,i) f° r extended and contracted sheath using (|6Tj) and the corresponding 
measured values of 9 = 0, 64.8° respectively. The two vectors obtained are fairly close to each 
other as is expected based on Figure[7| However, this computation reveals that the projection 
of these two vectors on the 1 axis is exceptionally close. That is, ei • Rf 1 (6 , )(yi i2 — yi,i) 
has nearly the same value of —21.2 A for extended and contracted tail sheath. We again 
hypothesize that this represents a special stiffness in this system, and we adopt it as a 
constraint, ei-R^ 1 (6')(yi i 2— yi,i) = —21.2 A. Written out using (JfiTj) and (fKTj) . this constraint 
is. 

2p cos 9 (l + V3 sin 7 - cos 7 ) + sin 9 (p sin 7 + pVs cos 7 - Vs (2A + pj) = 84.8 A. (62) 

It is natural to use this constraint to solve for 9, effectively making the orientation of each 
molecule slave to the variables that describe the spatial positions of the sheath. This is always 
possible for a wide range of reasonable values of 7, p, A satisfying the earlier constraint (JoT?|) . 
Some care has to be exercised with uniqueness, since generically ()62|) has a pair of solutions 
9; however, only one of these lies in a modestly expanded interval containing [0, 64.8°]. 

For uniform states, i.e., configurations obtainable using the formula (jf>7)) . the constraints 
(JSnj) and (p^jl reduce the energy to a function of the kinematic variables p and 7, effectively, 
radius and twist. It would be natural now to write the energy as a double- well energy in 
p, 7, with wells appropriate to contracted and extended sheath. However, it is advantageous 
to consider also distorted states, so that process of transformation can be described. 



B. Nonuniform states 



To describe nonuniform states, we first notice that our basic formula (JH7|) is still useful. In 
fact, this formula can be used to describe an arbitrarily distorted sheath, by simply allowing 
p, 7,A,Ri ; i to depend on To see this, we notice that if the molecule occupies a 

certain position and has a certain orientation, then one can always find a helical cylinder 
with molecule in the given position and with the given orientation. Effectively, the 
formula (|57J) with variables p, 7, A defines certain helical coordinate system based on the 
structure of T4 sheath. We note that this generalization changes somewhat the geometric 
interpretations given above of the variables p, 7, A, Ri,i. 

In a setting of this generality, one could make a reasonable extrapolation of what should be 
the constraints, based on the stiffnesses of the main helix discussed above, but the resulting 
276 degrees of freedom would still be rather large; once the energy of T4 sheath becomes 
known quantitatively, it will then be worthwhile doing something like this, since general 
configurations and forces could be then computed using standard nonlinear optimization 
techniques. For the present, we make a 1-D ansatz that positions and orientations are the 
same on each annulus, that is, 

(jij, Kij) is given by (J3ZJ) with p = pj, 7 = j j} A = Xj, K hl = Rj, 

z = l,...,6, j = l,...,23. (63) 

Our first goal is to reformulate the constraints in terms of these variables. We begin with 
the first constraint (JSHJ). If we calculate |t| = |R^(y ij+ i — yij)\ using (jHSJ) we see that 
it depends in a somewhat complicated way on j, but we can also see from the expressions 
that there is a natural change of variables that restores the simplicity of the expressions for 
uniform states. That change of variables is: 

lj = J (7j+i - li) + 7i, \ = - \) + \ J = 1, • • • , 22 - ( 64 ) 

The inverse mapping is simple averaging: 

J j = -^j(li + ---+J j -i), = t— y (Ai + • • • + Vi) > j = 2,..., 23. (65) 
Note that for uniform states, jj = 7j = 7 and Aj = Aj = A. When the expression |t| 2 = 

QA 



|Rij(yjj+i — y«,i)| 2 i s evaluated for 1-D states in these new variables, it becomes, 

Pj - 2 PjPj+i cos Ij + Pj+i + ( 66 ) 

and the connection with (|58p is immediately clear. In fact, it is expected based on the 
definition of 7j that the approximation cos 7-,- ~ 1 — (l/2)7j 2 is still reasonable and then (jHEJ) 
becomes (pj+i — Pj) 2 + Pj+iPjl] + ■ Comparing with (JoTJj) . it is natural to again replace all 
the pj by the effective radius p e fr- We therefore adopt in the nonuniform case the constraint 

A? + 7|p c 2 ff = 2170A 2 , (67) 

where p e g = 76.33 A. 

Now we generalize the constraints (J61j) . (J62j) on the orientation. First, we recall that our 
way of writing the formula (J57|) automatically adjusts the orientation of each molecule on 
the sheath in a consistent way (preserving the helices) in response to a change of Ri 1. Since 
we assumed above that R11 has the same axis for extended and contracted sheath, then we 
assume this remains true for nonuniform states and R11 continues to have the form (|61|) 
with 9 replaced by 9j. 

Once again, the change of variables (J64|) proves to be extremely useful, for if we now 
calculate the quantity ei ■ Rj^y^+i - y itj ) = e 1 ■ Rj(y iJ+1 - y itj ), we get, 

- ^—2 cos^- [pj + p 3+ i\/3sin7,- - p j+1 cosj^ 

- sm9j (pj + i sin7j + p j+ iv / 3cos7 i - Vs(2\j + Pj))) , (68) 

with the obvious relation to ()62|) . We therefore adopt the following constraint on orientation 
in the nonuniform case: 

2 cos 9j (jjj + Pj + i Vs sin 7j — Pj+i cos 7j) 

+ sin 9j (^Pj+i sin jj + pj +1 V3 cos jj — v / 3(2A j + pjf) = 84.8 A. (69) 

We again view this as a way to determine 9j, j = 1, . . . , 22, making the orientation slave to 
the other variables. 

In summary, there is a natural expression of the constraints within the context of the 
1-D ansatz, this being ()67|1 -(|69 )1 : no internal contradictions arise, and there is freedom to 



FIG. 8: Deformations of tail sheath satisfying the constraints and exhibiting transformation. See 
text. 

make a variety of distorted states that interpolate contracted and extended sheath. The 
unconstrained kinematic variables can be taken to be local twist and radius, which for 
distorted states turn out to be 71, ... , 723 and pi, . . . , P23. If these variables are subject to a 
simple interpolation between extended and contracted sheath, by defining 

7i = /i(j)(2vr/21) + (l-Mj))(2vr/H), 

Pj = fi(j) 73.75 A + (1 - fi(j)) 116.1 A, (70) 

where, for example, is a simple "tanh" transition layer, p(s) = |(tanh((s ~jo)/w) + 1), 
then one can exhibit a contracting sheath as is shown in Figure El These pictures are 
produced in this way, using the constraints (ffTTj) and (frTHj) to determine the Xj and 9j and 
then placing all in the formula (JoT|) . as directed by (Jrj3~j). All three of these pictures have 
the same interfacial width w = 1.5 and interfacial positions jo = 4, 12, 20, respectively. 
These are not necessarily equilibrium states, as the computation of these would depend on 
a quantitative knowledge of the energy function, which we do not yet know. 




FIG. 9: Nucleation of the phase transformation in T4 sheath as viewed from below. Left: extended 
sheath. Right: view of sheath with the first annulus fully transformed, as in the leftmost picture 
of Figure El 

These pictures are interesting from the point of view of nucleation. One of the important 
issues (raised in is that T4 tail sheath is at a scale that would seem to suppress 

martensitic phase transformation. Briefly, the argument is the following: in order to have 
phase transformation with a distortion one expects an interface to pass through the body 
having a transition layer between phases. But because of the scaling between bulk and 
interfacial energy, the interfacial energy should dominate at sufficiently small scales, and, 
therefore, in a sufficiently small body, one would necessarily pay more free energy for the 
transition layer than the lowering of free energy due to the presence of the new phase. In 
the present case "interfacial" and "bulk" energies are better thought of as line and surface 
energies, but the argument is similar. Thus, nucleation is expected to be an important 
issue for phase transitions at small scales, and this is particularly true in the present case in 
view of the enormous transformation strain of T4 tail sheath. It is known from the work of 
Moody that transformation begins at the baseplate. The distortion of the first annulus upon 
nucleation can be seen in Figure El An alternative view is seen in Figure which shows a 
view from below; in this figure the lowest annulus is nearly fully transformed (jo = 4) on 
the left while the corresponding untransformed sheath is shown on the right. 

Finally, a brief remark about constraints and frame- indifference. It is well-known that 

on 



internal constraints in mechanical systems should be frame- indifferent, and this may not 
be obvious in the present case. In (|57|) there is some freedom of how one assigns a change 
of frame, either attributing this to changes of 7, p, A, R11 or, for example, to changes of 
Q 7 (at constant 7), Q n /3, t , t, y 1; R11. The latter is preferred, and also preserves the 
1-D ansatz. The precise form of a change of frame y — > Ry + c, R 6 SO (3) is then 
Q 7 — > RQ^R r , Qtt/3 — ► RQ7r/3R r 7 to — > Rtoj t — > Rt, yi — > Ryi + c, Ri f i — > RR11. With 
this understanding, 7,p, A are objective scalars and the constraints are frame- indifferent. 



C. Free energy 

Having reduced the complexity of the energy by formulating constraints for a 1-D ansatz, 
we are now in the position to suggest a relatively simple form of the energy function for 
nonuniform states of the form (jHSj)- We take the independent variables to be 71, ... , 723 and 
Pi) • • • )P23- m the extended T4 tail sheath the main helix is the only direction of strong 
bondin g; h owever, in contracted sheath there are three bonding directions, as identified by 
Moodv [l^ and Leiman et al. jl2 |. These are the bonds — + 1), — (i — 1, j + 
1), — (i — l,j + 2). For this bonding the free energy is a minor generalization of ()15j) : 

^(yi,l, Rl,l ; • • • ; y6,23, R6,23) 

= V'iCyi.j) ^4,37 y«,j+i) R-ij+i) + ^(yij, Ri,j, yi-i,j+i; Rj-ij+i) 

ie{i,...,6}, je{l,...,23} 

+ ^(y^j, R«j, yi-ij+2, Ri-ij+2). (71) 

Here we have omitted separate consideration of boundary molecules; to account for molecules 
beyond the boundaries, we do a suitable periodic extension. Recall that the ipi,^, i>3 depend 
on certain objective quantities, the t's and Q's, cf., (fTTJj) . 

We assume the 1-D ansatz (J53~j) and the constraints (joTj) . (|55|). If we write out all 

of the frame-indifferent expressions appearing in the arguments of ^1 in the sum ([71)1 . we 
have 

R ?i R i,i+i = /(Pi> Pi+i> 7i) R fi(yij+i - y»j) = g{pj, Pj+u ij), ( 72 ) 

where / and g are somewhat complicated algebraic vector-valued functions. We recall that 
this bond (along the main helix) guides the assembly of the extended state and is preserved 
throughout contraction. Since this bond is relaxed in the extended state and undergoes 
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relatively small deformations, one simple way to model it is as a harmonic function centered 
at the extended state: 



}+k 3 (p j+1 - Pj y (73) 



where k and k{ > are constants, p e and 7 e are the values measured for the extended sheath. 
The term containing k 3 is suggested by the presence of pj,pj + i and the expectation that 
this energy is minimized by the uniform state: this term is somewhat like the terms of the 
energy of a liquid crystal. 

The bond (i, j), (i — 1, j + 1) spans between adjacent main helices. This bond is largely 
non-existent in the extended sheath and its formation drives the contraction. However, the 
energy ip 2 for this bond depends on the same set of variables pj, Pj+i, as for (i, j), (i, j + 
The radius and pitch of adjacent turns of the main helix provides a measure of the second 
bond's state. If the adjacent turns are close to the contracted state then the bond is formed. 
For configurations where the helices are far apart the bond is essentially broken. And for 
configurations where the helices become very close there is a strong repulsion. Consistent 
with this we propose the potential 

1 



^(jij, R-ij, yi-ij+i, R-i-ij+i 

where 

£(A) = 



12 



(1 - k 4 ( Pj - p c ) 2 - k 5 (p j+1 - Pj f) CfXj), (74) 



(75) 



-a(c - A) 2 (c - 3A C + 2A), A < c, 
0, A > c. 

is similar in shape to a Lennard- Jones potential, except that it has a cut-off at c, where 
C and its first derivative vanish (It is continuously differentiable). This part of the energy 
depends on the parameters p c , A c , c, k^ k$, a, which have the following interpretations. For 
A > c the energy contribution to ip 2 vanishes (i.e., the bond is broken). The values p c , A c 
are the minimizing values of p, A for ip 2 ] in practice, we adjust these so that the measured 
values p c , A c are absolute minimizers of the total energy. The value a is the bond dissociation 
energy; k 4 controls the stiffness of this bond with respect to changes of radius, and k$ favors 
uniformity. The term containing /c 5 multiples C so that the tendency toward uniformity is 
not in force when the bond is broken. The third bond (i, j), (i — 1, j + 2) is similar to the 



second, in that it forms upon contraction. It spans two helices, so the third bond energy 
depends on the pitch and radius of the second nearest helix, and it involves the larger set 
of variables pj, Pj+i, pj+2,Jj,Jj+i- We take it to have a simple form similar to that of the 
second bond, 

= J2 (* ~ k ^ Pj ~ ~ p< ^ ~ k ^ +1 ~~ p ^ 2 ~ ^(Pi+2 ~ Pi) 2 ~ fc 7(A i+ i - \jf) C(Xj), 

where £ is as in (|75)l. In principle, all of the parameters p c , A c , c, k^, k 5 , a are likely to differ 
for bonds 2 and 3, but we do not alter the notation to reflect that. 

So, in summary, for the constrained sheet subject to the 1-D ansatz, we write the total 
free energy 

*(yi,i, , • • • , ye,23, R-6,23) = ^2 ^(PjiPj+ii Pi+^ h> •VfO ( 76 ) 

je{i,...,23} 

where the energy per annulus if) = 6 (^1+^2+^3) an d h > 0,i = 1, . . . ,7, a > 0, kik 2 — k 2 > 
0, A c < c < A e . Note that, because of presence of the cut-off, the values p e , A e are always 
relative minimizers of the energy if (as we assume) the stiffness matrix in (|73*jl is positive- 
definite. 

This energy favors uniform configurations for a suitably restricted domain and for ranges 
of the parameters expected to be physically interesting. Consider the domain (Xj,Pj) where 
£ < and the prefactor of £ is positive. Then a lower bound for the energy on this domain is 
obtained by putting ks = k§ = k$ = and this bound is achieved by a uniform configuration 
that minimizes each term (The individual terms of the sum are minimized at the same 
uniform state). We use the notation 

<f>{p,X) =ip(p,p,p,X,X) (77) 

for the energy per annulus of uniform states. 

A simple explicit energy that uses all of the measured data that we have available, but 
otherwise makes somewhat arbitrary choices of constants, and has a relative minimizer at 
the extended state and an absolute minimizer at the contracted state, is obtained by putting 
fci = 0.333 zeal A" 2 , k 2 = 3.0 zeal A" 2 , k = 0, k A = 10~ 4 A" 2 , a = 0.3719 zeal A" 3 , c = 
30 A, A c = 13.3901 A, p c = 161.398 A, and then by evaluating at a uniform state (1 zeal 
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FIG. 10: Special energy for tail sheath. 

= 1CT 24 kcal). This gives the double- well energy pictured in Figure E3 Here the choice 
of a reflects the calorimetric measurement of Arisaka, Engel and Klump that gives 
4>(p e , A e ) — <P(Pc, A c ) = 60 zcal/annulus, based on arguments described at the end of this 
subsection. 

The tendency toward uniform states plays an important role during self-assembly of 
tail sheath. During assembly, the baseplate forces the first annulus to have the extended 
radius, p\ = p e . As subsequent annuli are added they do so as to match the radius of 
the annulus below. The second and third bonding directions remain incomplete, since the 
formation of these bonds would require the annuli to adopt the contracted radius. Proper 
assembly is accomplished by design; the penalty for mismatching a neighboring annulus 
outweighs the energy the could be liberated by forming the additional bonds. Our energy 
given above has the flexibility to model this behavior through two features: 1) the state 
(p e , A e ) is a minimizer of energy with respect to all uniform small perturbations of (p, A), 
and 2) the terms involving k^, k^, kg, can be tuned so that the addition of a new layer 
onto the growing extended sheet would be penalized from being added with (p, A) near the 
contracted values, even though these have lower uniform energy. The complete analysis of 
self-assembly would require a molecule-by-molecule growth mechanism, involving boundary 
energies, but the present energetic considerations are expected to play a role. 

A 1 



For the rest of the paper we use a more general energy than the special form given above, 
but one that retains some of its essential features. That is we assume an energy per annulus 
of the form ip(pj, Pj+x, Pj+2, Xj, Xj+i) (cf., (JZHJ)) having the properties 



1p(Pc, Pc, Pc, A c , A c ) < 1p(pi,P2,P3, Al, A 2 ) 



for all (pi,p 2 ,pn, Ai, A 2 ) 
not equal to (p c , p c , p c , A c , A c 



for all (p 1 ,p 2 ,p 3 , Xi, A 2 ) 
^>(p e ,p e ,p e , A e , A e ) < ip(p 1 ,p2,P3, Ai, A 2 ) <v near, but unequal to, 

(Pe, Pe, Pe, A e , A e ), 

V 

^(PoPoPo A c , A c ) < ^(p e ,p e ,pe,\ e ,X e ). (78) 

For this more general energy we retain the notation 4>(p, X) = ip(p, p, p, A, A), so it follows 
from the above that 

0(p c ,7c) < <KP,7) for all (p, 7) ^ (Pc 7c), 

<Kp e ,7e) < 0(P,7) fo r all (P,7) 7^ (Pe,7e) but near (p e ,7 e ). (79) 

We make one other assumption on the height difference between the energy wells. In 
\v\ Arisaka et al. did calorimetry on T4 tail sheaths with contraction triggered by two 
methods: raising the temperature to 72 C and by increasing the concentration of urea. The 
former gave -44 kcal/mole (of gpl8 molecules) whereas the latter gave -25 kcal/mole (of gpl8 
molecules). We use the former number here as it was considered the more accurate by these 
authors. From the details of the measurement, raising the temperature did not give reversible 
contraction, but rather irreversible contraction, and temperatures higher than 72 C caused 
denaturation of the whole sheath. Thus, one can infer that the free energy of contracted 
sheath is still lower than of extended sheath at 72 C, though not as low as at 25 C. Without 
any additional information and considering that at least spontaneous contraction occurred 
at 72 C, we estimate the height difference between the wells by the following procedure. We 
restore the temperature dependence of <fi and Taylor expand in the temperature, omitting 
the error terms, 

<Kp, a, e 2 ) = <p( P , a, e x ) + mp d ^ 9l \ e 2 - e 1 ). (so) 

We put 61 = 25 C and 6 2 = 72 C, evaluate (J8U|) at (p e , A e ) and (p c , A c ) and subtract, 
estimating 0(p e , A e , 9 2 ) ~ <fi(p c , A c , 2 ). Now, as is common in the interpretation of calori- 

10 



metric measurements of phase transformations, we interpret the temperature times entropy 
difference as the latent heat: 

62{ d<p(p x 0l ) _ d<p( P x e 1 ) ) = 44Q zcal/annulus _ (81) 

do do 

Here we have ignored the temperature dependence of the entropy evaluated at either well 
separately. Combining ()80|) and (|81|) we get that the entropy difference of the two phases is 
1.27 (zcal/K annulus) and that 

0(p e , A e ) — 4>(pc, A c ) = 60 zcal/annulus. (82) 



D. Some simple uniform deformations and some relations between moduli 

For the purpose of defining various moduli, it is convenient to introduce the free energy 
per unit reference length (the reference being the contracted state) by defining 

c (p, 7 ) = l0(p,A). (83) 

^c 

Second derivatives of c (p, A) with respect to the pair (p, A) have interpretations as various 
moduli. For example, if we consider small deformations about, say, contracted sheath, then 
we write 

MP, A) = € + \ {A(p - Pc f + 2B(p - Pc )(A - Ac) + C(X - A c ) 2 ) + . . . , (84) 

where (j)° c is the free energy per unit length of undistorted contracted sheath. We assume 
this form is positive-definite. 

We now interpret these moduli A, B, C. Working within the ID ansatz, suppose we hold 
the annulus j = 1 fixed and apply an axial force f = /e3 to annulus j = 23, treated as a 
dead load. Then the total energy of sheath and loading device is 

*(yi,i, Ri i , • • • , y 6 ,23, R-6,23) - yija • f = *(yi,u R.1,1 , • • • , ye,23, R.6,23) - 22 / A 23 (85) 

(Recall the relation between Xj and Xj, equation Using the assumptions (|75jl and the 

argument just preceding Ij77j) . we see that the minimizing state {pi, . . . , P23}, {Ai, . . . , A23} 
is uniform, 

Pi = ■ ■ • = P23 = p, Ai = • • • = A 23 = A, (86) 



and (p, A) minimizes 

c (p,A)-/A (87) 
Minimizing this expression over (p, A) for small values of /, we get 

X ~ Xc = X C (AC-B 2 ) f + " • ' 

- ~ B t + 
9 pc ~ \ C {AC - B 2 ) 1 + ' ' ' ' 

The tensile modulus (i.e., the proportionality factor between / and (22A — 22A C )/22A C ) is 
therefore 

X 2 C (AC-B 2 ) 

tensile modulus = — . (89) 



Hence, due to the positive-definiteness of the quadratic form (|84|). tensile force produces 
extension, and also twist, with an end angle that decreases with increasing force. We expect 
B > in which case the Poisson effect is the usual one: lengthening produces a decrease 
in the radius. We can define a "Poisson's ratio" via the usual formula (—radial strain/axial 
strain): 

Poisson's ratio = — — (90) 
p c A 

For simple torsion defined by the loading device energy — 22M7, where Me% is the applied 
moment, energy minimization of <p c — M(7/A c ), analogously to the above, leads to uniformity 
and to the equations 

A\ c 

7 7c " llpi ff {AC-B 2 ) M + "--' 

From here, we identify 

... , . moment -f 2 p 4 eff (AC - B 2 ) 

torsional modulus = = — — (92) 

twist /length A v ' 

Thus we predict that the torsional modulus is proportional to the tensile modulus, the 
proportionality factor only depending on the geometry of contracted sheath. In contrast, 

A A 



in macroscopic elasticity the torsional and tensile moduli are governed by different elastic 
constants (i.e., the shear and Young's moduli, respectively). This unusual behavior arises 
from our unusual constraints. 

Finally, we briefly consider the resistance of protein structures to internal pressure. This 
may be relevant to the interactions between the sheath and tail tube, so in fact it is more 
related to extended sheath (It is also of course highly relevant to the packaging of DNA in 
capsids [18]). For internal pressure p, the associated loading device energy is 23\irp 2 p. It is 
trivial to work out the associated moduli so we do not record that here. 

What is more interesting is to work out the reaction forces. When a body is constrained, 
there should be reaction forces, that is, certain kinds of forces that do not produce defor- 
mation. In the present setting but in the fully nonlinear case, we consider a sheath subject 
to a tensile force /, a twisting moment M and an internal pressure p altogether, with an 
associated loading device energy -22/A - 22M7 + 22Xnp 2 p. Taking the first variation of 
the energy with respect to (p, 7) we see that the resulting two equations do not uniquely 
determine the three unknowns (f,M,p). In fact it turns out that changes of p generically 
lead to deformation and the constraint force only involves / and M. The result can be 
stated in the following way. Suppose that the sheath is in equilibrium at a state (p, 7) corre- 
sponding to generalized forces (/, M,p). Then, (p, 7) is in equilibrium if (/, M) are changed 
to (/ + A, M + Mi), where, 

M t = h = f, (93) 

Alternatively, this condition can be thought of in terms of work: changes of force and 
moment consistent with (|93|) do no work on the sheath. 



IX. RELAXED STATES 



It is interesting to contrast our theory of the protein sheet with nonlinear continuum 
theories of plates and shells. As a related example, the mechanical behavior of carbon nan- 
otubes have been shown to conform to such continuum theories in many aspects, especially 
regarding elasticity and buckling ^| . The results given above arising from the constraints, 
especially the first order Poynting effect and the relations between elastic moduli, suggest 

A K 



differently, but these results are closely connected with the presence of the constraints. As 
we show here, the predictions of the unconstrained theory are also essentially different from 
continuum theories. This is not fundamentally a "nanoscale" phenomenon, but is related to 
the particular structure of protein sheet: compact globular proteins with local bonding and 
a sensitivity to orientation. 

To review, nonlinear continuum theories of thin plates and shells come in various varieties, 
depending on the strength of the applied forces (for a rigorous treatment and an overview of 
the regimes, see [20]). For the largest applied forces there is membrane theory, defined in the 
following way: let <p(G) be the three-dimensional nonlinear elastic energy of the material 
expressed as a function of the deformation gradient G. We suppose as usual that <p is 
frame-indifferent, <^(QG) = ip(G) for all Q G SO(3) and that p> is minimized on SO(3). Let 
(e 1; e 2 , e 3 ) be an orthonormal basis with e 3 normal to the plate in its reference configuration, 
and write G = in this basis. To describe membrane theory we express the deformation 
gradient as its three column vectors, G = (yi|y2|ys)- If yt^i,^), (xi,x 2 ) G S is the 
deformation of the plate then, in the absence of a loading device, the energy is 



This is minimized over the independent fields (y(xi, x 2 ), b(xi, x 2 )) (b describes deformations 
relative to the "middle surface"). Suppose that we have no boundary conditions imposed. 
Then, the energy in (|94)l is minimized by (y, b) satisfying 

y,i= R(x 1 ,x 2 )e 1 , y, 2 = R(xi, x 2 )e 2 , h = R(x 1 ,x 2 )e 3 , R(xi, x 2 ) G SO(3). (95) 

The third of these equations simply determines b, while the first two restrict the deformation. 
In fact, the first two of these equations define so-called isometric mappings. Isometric 
mappings are essentially the mappings that one can illustrate by taking a flat sheet of paper 
and deforming it, including the possibility of making folds and rather complex "crumpling". 
At the next level of approximation, for weaker forces, we have nonlinear bending theory. 
This is defined by the same kinematics as just described, but with the energy 



where ip(G) = mm c q(Gij + q 5 3j ), Sij is the Kronecker delta, and q is the quadratic form 




(94) 




(96) 




(97) 



In this case the energy ()97|) is minimized over isometric mappings only, i.e. over the solutions 
of (J95j) . Thus, in summary, isometric mappings are the basic relaxed states of plate theories: 
they are the zero energy deformations of membrane theory and the finite energy deformations 
of bending theory. Shell theories are variants of the above in which the given reference state 
is generally curved. In that case the finite energy deformations are isometric mappings of 
the curved reference state. 

What are the relaxed states of the present theory? To calculate the analog of the above, 
we should minimize the energy of the sheet 



*(yi,i, Ri,i , . . . , yjv,M,Rjv,M) (98) 
= ^i(yij) yj+ij) R-j+ij) + ^2(yjj,Rjj,yij+i,Rj,j+i), 

(M')ez 2 n£> 

without loading device energies or boundary conditions, over all positions and orientations. 
Here we have assumed two bonding directions and we have ignored boundary molecules. 
To simplify, this can be written (modulo possibly a few missing or additional boundary 
molecules) 

*= Yl MtijiQiJ+MtwQiJ), (") 

(i,j)ez 2 nv 

but now it must be born in mind that the independent variables (tjj, Qij, ti j, Qi,j) are 
subject to the compatibility conditions (J2U). These conditions, repeated here 



(100) 



QijQij+iQi+ijQij — I) 



couple molecules with {i + 1, j) and + 1). Clearly, we cannot (as we did above) 
minimize (ffilj) by minimizing the "integrand" iftifaj, Qij) + ^(tij, Qi,j), for this would 
typically give a minimizer, say, of the form 



H,j — t, 



tjj = t, 



> where < 



V>i(t,Q) < ^i(a,R) for all (a,R), 
Mt, Q) < 4> 2 (a, R) for all (a, R), 



(101) 



A<7 



and it is seen that such a minimizer would generically fail the compatibility conditions, 
which in this case become 

QQQ T Q T = I, 
t + Qt - Qt - t = 0. (102) 

One can consider more complex rearrangements of the sum (J99j) . with different "integrands" , 
but the analogous problem arises again. For example, the apparently most promising rear- 
rangement is the sum 

(i, j) e z 2 n p 

i + j = even 

(103) 

Here the summand contains exactly the independent variables appearing in the constraints, 
and therefore we could minimize it with respect to all values of the independent variables 
subject to the constraints. But one then sees that the solution actually satisfies only the 
compatibility conditions on every other cell and generically does not give a minimizer. 

In discrete theory the impossibility of minimizing the energy for each each bond indi- 
vidually is termed frustration. In continuum theory the concept is similar j^jj: it is the 
inability of minimizers of the energy density to satisfy conditions of compatibility inher- 
ent in the kinematics We can say that our sheet is also frustrated, in the sense that 
minimization of the energy density for each bond does not generically give a compatible de- 
formation. Here, the word "generically" means that, even if these compatibility conditions 
happen to be (accidentally) satisfied for a minimizer, then they are not satisfied if ipi or 
ip2 are smoothly perturbed consistent with all of their assumed symmetries. As indicated 
above, even if we allow small collections of multiple bonds and minimize the energy of these, 
subject to constraints of compatibility, we also obtain a configuration that is not compatible 
in the large. 

In biology, unlike materials science, there is the phenomenon of evolution of materials to 
achieve fitness. Thus for a protein sheet, there might be reasons, for example, to achieve 
a particularly low energy state, for a protein sheet to be nongeneric. Thus it is of interest 
to assume that t, Q and t, Q minimize, respectively, ipi, ip2 and also satisfy ()102j) and then 
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to see what kinds of sheets emerge. We call such states fully relaxed states: each bond is 
relaxed and the configuration is compatible. 

To calculate all fully relaxed states, we merely have to characterize all solutions of (jl()2j) 
and then calculate the implied positions and orientations. This is a straightforward algebraic 
exercise and we just give the results. First, a useful characterization of the solutions of (jl02j) 
is the following. Each solution falls into one of the categories below: 

1. Q and Q are coaxial, Qe = Qe = e, |e| = 1, and 

(a) If Q 7^ I and Q ^ I, then t — t± + re and t = ti + fe for some r, f with 
ti • e = tx • e = 0, and t x = (I — Q) _1 (I — Q)ti, the inverse taken on the plane 
perpendicular to e. 

(b) If Q ^ I and Q = I, then t is arbitrary but t = re for some r. 

(c) If Q = I and Q ^ I, then t is arbitrary but t = re for some r. 

(d) If Q = I and Q = I, then t and t are arbitrary. 

2. Q = — I + 2e ® e and Q = — I + 2e ® e, |e| = |e| = 1, e • e = 0, and 

(a) t = Tie + r(e x e), t = T^e + r(e x e) for some r 1; r 2 , r. 

Now, using these results we go back and compute the uniquely determined (up to overall 
rigid body motion) positions and orientations. For simplicity we assume anA^xM sheet. 
We find, in all cases, 



R-i+i.j+i — R-i,iQ*Q' J , 
yi+ij+i = yi,i + R-i.i 



Q fe t + q j Yl Q fct 

k=0 k=0 

i = l,...,N, j = l,...,M. (104) 



But this is exactly of the form of (J57|) for T4 tail sheath! (Note: recall that in the formula 
for tail sheath the R^i was moved through the Q*Q J using the remark given at the end of 
Section EJ). 

It is intriguing to ponder whether these fully relaxed states are actually realized by 
tail sheath (or other protein sheets) and, if so, the implications of this with regard to 



in 



stability, and evolutionary development. Of course, with the various choices of t, Q, t, Q 
as enumerated above, the sheet will not look exactly like tail sheath. We explored this 
numerically by choosing various cases and found that the general appearance is however 
much like tail sheath; in fact, it can be proved from the formula (jl()4j) that the bonding 
direction (i, j) — (i — l,j + 1) is also a helix j^] (with translation Q(t + 1)). Figure [TT] shows 
a generic picture of a fully relaxed state, with the molecule represented by a simple ellipsoid. 

To complete this story, we make brief remarks about the remaining cases of the enumer- 
ation above. In item 1, if either Q = I or Q = I then the appearance is still more or less 
like Figure but one family of helices degenerates to straight lines of molecules that are 
parallel to the axis of the cylinder. If both Q = I and Q = I, then the cylinder degenerates 
to a planar sheet, with crystalline symmetry, and parallel orientations of molecules. Finally, 
item 2 is a bit surprising; it describes a collection of four molecules, not generally at the 
corners of a regular tetrahedron, but such that each pair of the molecules is twinned, that 
is, individuals of the pair are related by a 180° rotation. 

In summary, our theory of a protein sheet is generically frustrated. Energy minimizers 
are generally naturally curved, as in shell theories, but this curvature is determined by the 
energy. Isometric mappings seem to play no role here. In our theory if one considers energy 
densities ^1,^2 that are minimized at compatible pairs (t,Q),(t, Q) (i.e., fully relaxed 
states) then the energy minimizers look much like the tail sheath of bacteriophage T4 and 
are given by simple formulas. 



X. EXPERIMENTS SUGGESTED BY THE THEORY 

We have noted above several places above where there are possible experimental tests 
of our predictions. These include the extension-twist relation (Figure IBJ), the linearized 
behavior near contracted or extended sheath ()89)) - (j92|) . and the reaction forces that preserve 
deformation (j^H)). We now discuss two other types of predictions that relate directly to 
biological and interesting nonbiological behavior. 




FIG. 11: A generic fully relaxed state. 



A. The force of penetration 



One of the most important predictions of our model is the force of penetration. Consider 
applying an axial force / to contracted sheath, so as to stress-induce the transformation 
to extended sheath. Alternatively, one can imagine applying sufficient tension to extended 
sheath to just prevent contraction, i.e., the stall force. We neglect the interactions with the 
tail tube, assuming it to be weakly bonded to the sheath even when it is in the extended 



state, as is thought to be true, |22|, 12[. For small values of / the behavior is given by 
the analysis of (|S5 |) -(|S9 |) . and we expect the initial slope of the force-elongation curve (/ 
vs. (A — A c )/A c ) given by the modulus \ 2 C (AC — B 2 )/A. There is expected to be significant 
nonlinearity of the response, because the constraints themselves are nonlinear. The details of 



the response near transition may depend on details of the loading device - whether hard or 
soft, for example - but one expects some kind of load drop on nucleation. The transformation 
is expected to take place via movement of an interface, as pictured qualitatively in Figure El 
and at approximately constant free energy, because a bias of free energy toward either phase 
would, by energy minimization, tend to drive out the interface, one way or the other. This 
suggests the criterion 

M>(y c , R c ) - 22 / \ c 23 = *(y e , R e ) - 22 / A* 3 , (105) 

where the uniform states (y c,e ,R c ' e ) = {y c {l, R^ , . . . , Ye 23)^623) are assumed to be in 
equilibrium. Let superscripted variables (A 6,c , p e ' c , 7 e,c ) be associated with these uniform 
equilibrium states. Using the special form of the energy (|79 j) - (jS3]) : we get 

c (p c , A c ) - / (A c /A c ) = c (p e , A e ) - / (A e /A c ), (106) 

where 

dMp c ' e , A c - e ) _ Q dMp c > e ,^ e ) = f_ (107) 



dp dX X 

Solving these together, we get, 

f f (0 c (p e ,A e )-0 c ( P c ,A c ))A c 

J - J trans _ ^ 

. i*M^ + (._^. r ), (108) 

The error term depends also on geometric factors and can be written explicitly, but we 
note that it is of the form (/ 2 /tensile modulus). Thus, if / <C tensile modulus then this 
term is negligible as compared to /, and the force at transformation is a simple ratio of the 
height difference between the energy well minima at extended and contracted sheath and 
the difference between the lengths of the sheath. These kinds of results are well known in 
the study of phase transformations. 

Of course, the virus uses the reverse transformation, from extended to contracted sheath, 
during penetration. The maximum force available for penetration is expected to be also 

f trans ■ 

We can evaluate the force of contraction based on the height difference between the wells, 
accounting for the reservations given at the end of Subsection IVIH CI Combining (|108J) and 



we get, 

f trans = 103 P N (109) 

By comparison, the stall force measured by laser tweezers during DNA packaging in 029 
was 57 pN Q. We would tend to think that the number 103 is a lower estimate for the 
actual force, because A0 is underestimated at 72 C as explained in Subsection IVIH 01 If we 
divide this force by the cross-sectional area of the sheath to get a stress, we get about 0.5 
MPa. This is quite low as a (maximum) transformation stress in a macroscopic crystalline 
martensitic material. However, the transformation strain in T4 sheath is enormous, so, if 
we calculate the energy density of contraction based on these numbers, we get numbers that 
are comparable to those measured in the best shape memory materials, which themselves 
exhibit the highest energy densities in any known actuator system j^J. 

The transformation can also be induced by applying a pure axial moment. This leads to 
the analogue of ()105|) - (jl08|) . except using the loading device energy — 22M7, and gives the 
moment at transformation of 

(0 c (p e ,A e )-0 c (p c ,A c ))A c 



M = M trans 



(7 e - 7 C ) 

(p e ,A e )-0 c ( Pc ,A c )))A c Q f A c , e m2 \ 

le-lc V c ' e A c>e C c „-Bl I 



c,e 



Neglecting the higher order terms in ()108|) and ()110|) we have the simple approximate re- 
lationship between the forces and moment needed to cause transformation in the sheath: 

ftrans ~ T 7 trans TT T M-trans- (11-0 

A e -A c 89 A 

B. Biomolecular epitaxy, patterning and devices 

In this section we explore some more speculative ideas. T4 tail sheath is a kind of 
biomolecular actuator, and one could imagine that it could function as part of a man-made 
machine j^j that could interact in an intimate way with biological organisms. Tubes of 



B Q; 



polysheath is similar to, 



polysheath several microns long have been synthesized 
but not exactly the same as, contracted sheath. Tubes of extended sheath have not been 
synthesized separate from the the baseplate and tail tube, and this is understandable in view 




FIG. 12: Edge-on view, slightly below horizontal, of the interface between contracted (left) and 
extended (right) sheath. 

of their higher free energy and also the possible role of the baseplate in stabilizing extended 
sheath via epitaxy. 

One basic interesting line of thought is to consider the possibility of changing the heights 
of the energy wells. As is true in a great many biological systems, hydrophobicity plays a 
critical role, and this can be appreciated in the present case by looking at Figure |3J There, 
it is clear by inspection that extended sheath exposes substantially more surface area to the 
surrounding solution than contracted sheath, and this qualitatively explains its higher free 
energy. But it also indicates that the relative free energies of extended and contracted sheath 
are amenable to adjustment via manipulation of the solution chemistry. Systematic studies 
2(| of the effect of solution chemistry on the breakup of parts of the virus (capsid, neck, 
tail sheath, tail tube, baseplate, tail fibers) demonstrate sensitivity to solution chemistry. 
Apparently, solutions that cause contracted tail sheath to extend have not yet been found. 
However, if by this means one could exchange the heights of the wells, then tail sheath would 
be like a shape memory material. In a highly schematic way, one could alter the solution 
so that extended sheath is stable. Then one could add an axial tensile force to the sheath. 
Again manipulating the solution, one could return it to phage-physiological conditions and, 
if the force was not too large (i.e., below the value f trans of (|1U8|1 ) then the sheath would 
transform to the contracted form, while doing work on the force. This would be a machine 
that converts chemical free energy of the solution to mechanical energy. The very small 
cross-section of the sheath would allow it to target a small region of a cell. One could 
consider the possibility of vast arrays of these tubes. In this regard, we note that ordered 
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FIG. 13: The two compatible interfaces that separate extended (above) from contracted (below) 
sheath. In each case the dark line of atoms was the main helix before unrolling the sheath. 

planar arrays of whole viruses have been deposited on surfaces (not using epitaxy) by Lee 
et al. Q 

In the following discussion we allow rather drastic changes of the sheath, but we enforce 
the constraints (|59|) and (|62[). these being in our view fundamental to its behavior. T4 sheath 
is in the shape of a cylinder, but it is interesting to think about the possibility of slitting along 
a generator and unrolling it. We first note that it is possible to do this without violating the 
constraints. Secondly, extended tail sheath exhibits an epitaxial relation to the baseplate, 
and this likely plays a role in self-assembly of the sheath during formation and subsequent 
stabilization. Thus we suggest the possibility of growing films of tail sheath epitaxially. In 
general, epitaxial growth is aided by a substrate with the same lattice parameters as the 
sheath, that is also chemically compatible with the sheath. The most likely possibility is 
to grow the lower free energy contracted form (see the left of Figure which shows the 
epitaxial surface). It is interesting to note that epitaxial growth of protein sheets could 
possibly take advantage of the shapes of molecules and the presence of functional groups, 
in addition to the matching of lattice parameters and use of surface chemistry, the latter 
principles familiar from the epitaxial growth of semiconductors. 

While it does not violate the constraints, unrolling is a pretty drastic distortion, so one 
can expect some deviation of the lattice vectors from the values tQ £ ,t c ' e of Section IVHl 
Nevertheless, in the analysis below, we do use those values, together with the orientations 
Ri'i- Without loss of generality (using frame-indifference) we first rotate these vectors into 
the 1,2-plane (we do not relabel the resulting vectors) and we apply the same rotations to 



FIG. 14: Shear- inducing the transformation from contracted to extended sheath. Drawn with the 
lattice parameters of extended and contracted sheath but with molecules represented by dots. The 
molecules are released from the substrate except for the two dark strips at the top and bottom of 
the sheet. 

the orientations. We plot the sheet as ito + jt where % and j are integers. 
The transformation matrix G that maps tQ,t c into tg, t e is the matrix 



By direct calculation G T G has eigenvalues 2.06, 0.567. As is known from the theory of 
martensitic transformations in sheets j^J, the fact that these values straddle 1 (i.e., 2.06 > 
1 > 0.567) means that there are exactly two interfaces on the sheet where extended and 
contracted sheet meet compatibly. These are pictured in Figure ED We have rotated the 
sheets suitably so that the interfaces are horizontal. The original orientation can be inferred 
from the dark lines of atoms, which correspond to what was the main helix (cf. Figure EJ) 
before unrolling. A cross section of the interface on the right is seen in Figure 

Note that on the left of Figure IT31 the dark line of atoms is approximately in the direction 
of the interface. This reflects the constraint, which embodies the idea that the two phases are 
approximately equally stretched along this line. The reason that this line does not exactly 
coincide with the interface is related to the use of the effective radius, rather than the actual 
radius, in (j59jl . As mentioned above, the values of lattice parameters are likely to change a 
bit with "unrolling" , leading to interfaces that differ somewhat from those shown in Figure 



From Figure EH one can imagine the possibility of stress- inducing the transformation by 
shear as shown in Figure El This would provide a direct measure of the relative heights 




(112) 



of the energy wells and therefore of the contraction force. Ideally, one could begin with an 
epitaxially grown sheet, as discussed above, and release the film from the substrate on the 
medium gra y r egion of Figure El some of the techniques developed in the microactuator 
community [28] (such as backside etching) for patterning and releasing single crystal films 
could be relevant. Then by applying shear and slight extension as shown in Figure ITU the 
phase transformation could be made to occur. Technically, the corners between phases may 
introduce stress concentrations in such an experiment, but this can be overcome by using 
a suitable indentor that induces an appropriate out-of-plane deformation. Once again, it 
would be fascinating to bring chemistry into such an experiment by altering the solution 
around the sheet. 

APPENDIX A: EQUATIONS FOR THE MOMENTS PRODUCED BY HELICAL 
CONFIGURATIONS 

This is a proof of (j41j) o and (j42j) . Let i G {2, . . . , N — 1}. The terms of the energy (|35)1 
that contain Rj are 



In this expression replace by (I + sW + . . . )Rj, W T = — W, differentiate with respect 
to s and evaluate at s = 0. This gives (|4*T])-> together with the formula 



Components are with respect to the rectangular Cartesian orthonormal basis used in the 
paper. So, we need to show that (|A2jl reduces to (JI2J). In (|A2|) write R^+i = HeQe and 
define 



(Al) 




(A2) 




(A3) 



where, for any matrix A, SkewA = |(A — A T ). With this definition (jA2j) becomes 



(R-ARj) 



(A4) 



Recalling the notation (J3HJ), (jSll), we have 

dip(t e , Q e ) 




(A5) 



Multiply this by \e S jt and simplify to get, 

1 dip(t e , Q e ) 
(StU = -e sjt — . (A6) 

Now use the identity EjkiRij = Ei pq R p kR q i which holds in an orthonormal basis for any R £ 
SO (3) (i.e., invariance of the cross product under rotations) in the component version of 
£1: 



(R^)is(S|) s t(R|)jfct, 
(R^)j7 £slt {Si) st, 

{Re)sl dw, ' (A7) 



the last step follows from (jA5|) . This is 



APPENDIX B: APPROXIMATION OF ELECTRON DENSITY MAPS 

Many arguments of this paper relied on the approximate shape and orientation of the 
molecules. In order to have a reasonable but fairly simple representation of the molecules 
of tail sheath, we approximated the electron density maps of Leiman et al. (jllj; we are 
grateful to Petr Leiman for providing prepublication data from high resolution cryo-electron 
micrographs of extended sheath). Information about how the positions and orientations 
were extracted from the representations is described at the end of Section IVH Al 

The maps themselves showed clearly the presence of domains. These have the general 
appearance of nestled ellipsoids, so we approximated them by overlapping ellipsoids. We 
did this by partitioning the data by domain (the colored regions of Figure ITK|). Then we 
computed the total electronic charge and center of mass of each domain. Using charge 
neutrality we made mass density proportional to electronic density. This can have errors 
arising mainly from the presence of H atoms, but in fact the total charge of a domain (or 
molecule) in contracted vs. extended state differed by more than this error, so the quality 
of the data did not justify a more detailed analysis. 





FIG. 15: Comparison of electron density maps and ellipsoidal approximations. See text. 

With the center of mass of the domain now fixed, we adjusted the principle axes of the 
ellipsoids to match approximately the sectional data. Sections of the selected ellipsoids are 
shown in Figure ITB1 superimposed on the data. In this figure the concentric circles define the 
axis of the tail sheath and the Z values indicate the slices of the electron density map which 
were averaged. Here Z is an axial variable measured from a fixed (but arbitrary) reference. 
Domains for both extended and contracted sheath are shown. The ellipses are sections of 
the ellipsoid (for the enclosed domain) at the corresponding average value of Z (i.e., for Z= 
to 10 the slice through the ellipsoid was taken at Z = 5). 
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